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Enhanced yeast transcription employing hybrid promoter region constructs* 

© Yeast promoters of glycolytic enzymes are modified by 
isolating a fragment encompassing; the- RNA polymerase 
binding site and joining to the 5* end of this fragment a DMA 
sequence providing for enhanced inducible or constitutive 
transcription of a structural gene. Constructs are prepared 
for efficient expression of foreign genes in yeast 

Yeast strains 2150-2-3{pO/1GAPSOD) and AB110(pCt/ 
1GAPATO), producing human a,-entitrypsin and superoxide 
dismutase, were deposited at the A.T.C.C on May 9, 1984 
and given Accession Nos. 20708 and 20709, respectively; and 
2150-2-3(GAP5). 2150-2-3{Pyk5) and 21 50-2-3 (PH05GAP1). 
1^1 expressing Hepatitis B surface antigen, were deposited at the 

< A.T.C.C on May 9, 1984 and given Accession Nos. 20705, 
20706 and 20707, respectively. 
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ENHANCED YEAST TRANSCRIPTION EMPLOYING HYBRID 
PROMOTER REGION CONSTRUCTS 

BACKGROUND OF THE INVENTION 
Field of the Invention 

With the advent of hybrid DNA technology, 
production of mammalian proteins in microorganisms be- 
came a reality. For the first time, mammalian proteins 
could be synthesized in a unicellular microorganism by 
introduction of a gene encoding for the mammalian pro- 
tein under the transcriptional and translational control 
of regulatory sequences recognized by the microorganism 
host. The introduction of these foreign constructions 
into a microorganism host resulted in competition be- 
tween the regulatory signals of the construct and the 
regulatory signal endogenous to the host for the host 
systems involved with expression. The structural gene 
of interest is usually directed to a product which is 
nonproductive and may be injurious to the host. There- 
fore, host cells which can turn off the foreign gene 
can effectively dominate modified host cells. 

Substantial progress has been made in isolat- 
ing sequences concerned with transcriptional and trans- 
lational regulation for protein expression. But fre- 
quently flanking sequences, as well as distant sequences, 
may also affect the efficiency and regulation of the 
expression of the protein. Therefore, as one manipulates 
these various sequences, removing them from their native 
environment, and joining them to unnatural sequences, 
that is sequences other than the wild- type sequence, 
one can never be certain as to the result. 

In order to enhance the economies of producing 
proteins in microorganisms, there have been substantial 
efforts directed to improving the efficiency of tran- 
scription and translation, maximizing the proportion of 
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total protein directed to production of the desired 
product, enhancing the viability of the modified host, 
as well as improving the efficiency with which the modi- 
fied host may be obtained. 
5 Description of the Prior Art 

Guarente et al . , Proc. Natl. Acad. Sci. USA 
(1982) 79:7410-7414, describes a hybrid promoter region 
employing the GAL4 regulatory region. Guarente and 
Ptashne, ibid . (1981) 7B:2199-2203, report the presence 

10 of two domains in a yeast promoter, with a region up- 
stream from the TATA sequence providing an activation 
site. Kramer et al., ibid . (1984) 81:367-370, describe 
• the regulated expression of a human interferon gene in 
yeast employing the yeast acid phosphatase promoter, 

15 where expression is induced by phosphate or a tempera- 
ture shift. Tekamp-Olson et al . , Cold Spring Harbor 
Meeting, Molecular Biology of Yeast , 1983, describe the 
absence of deleterious effects on yeast viability when 
employing "short" promoters, as distinct from the pres- 

20 ence of such effects, when employing an extended promo- 
ter region or "long" promoters. 

SUMMARY OF THE INVENTION 
Novel hybrid promoter regions are provided 
for use in conjunction with constructs having a struc- 

25 tural gene under the transcriptional control of the 
hybrid promoter region and a terminator region. The 
hybrid promoter comprises a first segment providing 
transcriptional enhancement, either constitutive or 
regulated, and a second segment defining an SNA polymer- 

30 ase binding site and transcriptional initiation site. 

The hybrid promoter regions lack the deleterious effects 
associated with a wild-type promoter region in recombi- 
nant constructs, which results in reduced transformation 
efficiencies and lower yeast viability. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a diagrammatic view of plasmids 

GAP1-6; 

Figure 2 is a diagrammatic view of plasmids 
5 Pykl-6; and 

Figure 3 indicates the DNA linker sequence 
and a flow diagram showing its use in a construct for 
hSOD. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

10 Novel DNA fragments are provided as well as 

constructions employing the fragments for enhanced ex- 
pression of structural genes in a yeast host. The con- 
structs employing a hybrid promoter region provide for 
enhanced efficiencies of transformation and greatly 

15 improved viability of the yeast host as contrasted with 
those employing a wild- type yeast promoter. Concomitant 
with the improved viability is increased expression of 
a structural gene, in comparison with the truncated 
promoter region, and, therefore, greatly enhanced over- 

20 all yields of expression products. 

For the purposes of the subject invention, 
the "promoter region" is divided into two domains: 
(1) the structural gene proximal region, which includes 
the transcription initiation site, the "TATA" sequence 

25 capping sequence, as appropriate, and an RNA polymerase 
binding sequence, which sequence intends a sequence 
which includes nucleotides upstream from the initiation 
site for directing the initiation of synthesis of the 
messenger RNA; and (2) a distal region which provides 

30 for regulated or constitutive expression, with enhanced 
expression as compared to the first domain linked to 
non- functional yeast DNA. 

The hybrid promoters of the subject invention 
employ the RNA polymerase binding region of a yeast 

35 glycolytic enzyme promoter and a region upstream from 
said polymerase binding region, which is different from 
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the wild-type upstream region of the SNA polymerase 
binding region and provides for enhanced efficiencies 
of transcription. This distal region will be derived 
from either a sequence, usually a yeast sequence, in- 

5 volved in regulation of transcription, or a prokaryotic 
sequence which provides for enhanced constitutive ex- 
pression of the desired gene. 

Conveniently, cassettes or constructs can be 
prepared which provide for one or more restriction sites 

10 intermediate the promoter region and a related terminator 
region where the structural gene may be inserted, so as 
to be under the transcriptional control of the hybrid 
• promoter region. By having one or more restriction 
sites, one can provide for ease of insertion of the 

15 structural gene intermediate the transcription initia- 
tion and termination regions. The cassettes which can 
be prepared comprising the transcriptional initiation 
and termination region, having appropriate restriction 
sites for structural gene insertion can be cloned in 

20 prokaryotic vectors, so that after insertion of the 

structural gene, the resulting cassette, including the 
structural gene, may be cloned, isolated and purified, 
before introduction into a yeast vector. 

The cassette, will for the most part, have 

25 the following formula: 

-(P.R.(2)-P.R.(1))-R.S.-T.R.- 

wherein: 

P.R.(l) is the promoter region proximal to 
the structural gene and having the transcription initia- 
30 tion site, the RNA polymerase binding site, and includ- 
ing the TATA box, the CAAT sequence, as well as trans- 
lational regulatory signals, e.g., capping sequence, as 
appropriate; 

P.R.(2) is the promoter region joined to the 
35 5'-end of P.R.(l) associated with enhancing the 
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efficiency of transcription of the RNA polymerase binding 
region; 

R.S. is a sequence having one or more restric- 
tion recognition sites, preferably at least two restric- 
5 tion recognition sites, where the sites may result upon 
restriction into blunt ends or overhangs; 

T.R. intends the termination region, which 
will include the terminator, which may be a stem and 
loop structure, and which may be associated with one or 

10 more stop codons, a polyadenylation signal sequence, if 
any, as well as any other transcriptional and transla- 
tional termination sequences. 

P.R.(l) will generally be at least about 150bp, 
more usually at least about 20 Obp, usually not more 

15 than about 600bp, more usually not more than about SOObp, 
generally not more than about 450bp and preferably less 
than about 400bp; the sequence will extend in the down- 
stream direction of transcription to about bp +3, more 
usually bp -1 and may extend only to bp -20, more usual- 

20 ly to bp -10 (the numbering intends that +1 is the first 
bp of the initiation codon with which the promoter re- 
gion is associated in the wild-type host, while -1 is 
the immediately upstream bp and the integers increase 
in the direction of transcription; 

25 P.R.(l) will be derived from a strong yeast 

promoter, normally a glycolytic enzyme promoter, such 
as glyceraldehyde-3 -phosphate dehydrogenase, pyruvate 
kinase, alcohol dehydrogenase, phosphoglucoisomerase, 
triose phosphate isomerase, phosphofructokinase, etc.; 

30 P. R. (2) will provide for an enhancing function 

of transcription, which enhancing function may provide 
for constitutive or regulated transcription; regulators 
will be derived from regions associated with regulation 
of yeast genes, other than the natural or wild- type 

35 gene associated with the first domain in the wild-type 
or natural host, such as UDP-galactose epimerase (GAL10), 
galactokinase (GAL1), acid phosphatase (PH05), alcohol 
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dehydrogenase I and II (ADH I and II), etc. For yeast 
regulatory sequences, the domain will usually be at 
least about lOObp, more usually at least about 200bp, 
for convenience generally not exceeding about 3kbp, 
5 usually not exceeding about lkbp. desirably not exceed- 
ing about 600bp. The regulatory region will generally 
begin at least about 200bp from the initiation codon, 
usually at least about 300bp and may begin at 400bp or 
farther upstream from the initiation codon. 
10 Regulation can be as a result of a change in 

the chemical or physical environment of the host, such 
as a change in carbon source, e.g., glucose to galactose 
• or vice versa; a change in concentration of a nutrient, 
e.g., an inorganic nutrient such as a phosphate; or a 
15 change in temperature, e.g., 25°C to 35°C. Constitutive 
transcription can be achieved employing prokaryotic 
sequences of at least about 500bp, usually lkbp or more, 
for convenience, generally not exceeding about 5kbp; 
conveniently, the prokaryotic sequence can be obtained 
20 from the vector in which the cassette is cloned, illus- 
trative vectors including pBR322, lambda, Charon 4A, 
PACYC184, pUC5, etc. 

R.S. will generally be at least 4bp, more 
usually at least 6bp, and may be lOObp or more, more 
25 usually being not more than about 60bp and may include 
one or more, usually not more than about 10 restriction 
sites, where such restriction sites may be illustrated 
by EcoRI, BamHI, Sai l, Hindi II, Alu l, Aval, Tagl, Hgal, 
etc., having at least one unique restriction site for 
30 the construct sequences. 

T.R. is the termination region which will 
include the necessary transcriptional and translational 
signals for termination, such as the polyadenylation 
site, etc.; 

35 t.R. will generally be at least about lOObp, 

more usually at least 150bp, and generally less than 
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about Ikbp, usually less than about 600kbp; the termina- 
tion region may be derived from any convenient yeast 
sequence, so long as the terminator balances the promo- 
ter, conveniently being derived from a glycolytic enzyme 
terminator, where the terminator may be associated with 
the same or different enzyme with which the promoter is 
associated* 

Where a cassette is cloned in a bacterial 
vector, the construction will have the following formula: 

-(P.R.(2)-P.R.(l))-R.S.-T.R.-Hep(B)-(M(B)) a - 

I (-) 1 

b 

wherein all the symbols have been defined previously, 
except for: 

Rep (B), which intends a replicon or replica- 
tion system recognized by a prokaryotic host and may be 
derived from a plasmid or phage, such as ColEl, and R 
plasmid, e.g., pRK290, lambda, e.g., Charon 4A, Xdv, 
etc. ; 

M is a marker which provides for selection of 
hosts containing the construction, where (B) intends a 
prokaryotic, e.g., bacterial, host and a intends an 
integer of from 0 to 3, usually 1 to 2, although addi- 
tional markers may be present, where the marker allows 
for selection of the host containing the construct as 
well as providing for selective pressure on maintaining 
hosts having the construct; the markers include biocide 
resistance, such as antibiotic resistance, toxin resis- 
tance and heavy metal resistance; providing prototrophy 
to an auxotrophic host; providing immunity; and the 
like; 

the markers may provide for complementation 
of an auxotrophic host, e.g., his", ura~, trp~, leu* 
genotype, resulting in prototrophy; resistance to metals, 
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such as cup* genotype; resistance to antibiotics, such 
as amp r , tc r , cam r , str r , tur r genotype, etc.; 

b is (for 1, intending that the construction 
is either linear or circular, usually circular. 
5 The above construct can be used for insertion 

of a wide variety of structural genes, both prokaryotic 
and eukaryotic, both naturally occurring and synthetic, 
where the genes may include signal leaders for secretion, 
and the like. The genes may express enzymes, hormones, 
10 proteins from pathogens for vaccines, structural proteins, 
lymphokines, membrane surface proteins, immunoglobulins, 
blood proteins, or the like. The particular structural 
• gene which is inserted is not critical to this invention 
and any polypeptide or protein of interest may be pre- 
15 pared employing the constructions of the subject inven- 
tion. The structural genes will usually be foreign to 
the yeast host, where foreign intends different from 
wild-type yeast structural genes and from a source that 
does not normally exchange genetic information with 
20 yeast. 

Usually, the structural gene will be at least 
about 36bp, and not more than about 20kbp, usually not 
more than about 3000bp, usually not more than about 
ISOObp. Included in the structural gene may be non- 
25 coding flanking regions, the 5 1 -flanking region normally 
being quite short, usually less than about 30bp, while 
the 3' -flanking region may be extended, usually not 
exceeding about SOObp. Thus, the structural gene frag- 
ment will usually include the translational stop codons 
30 for proper termination of amino acid chain extension. 

When the structural gene has been inserted 
into the cassette which is joined to a yeast replication 
system, normally including one or more markers recog- 
nized by yeast, the resulting construct will have the 
35 following formula: 



r t 
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-(P.R.(2)-P.R.(l))-gene-T.R.-Rep(Y)-(M(Y)) - 

i Li 

wherein all of the symbols have been defined previously 
except for: 

5 gene, which intends the structural gene, having 

its initiation codon and stop codons as appropriate; 
and 

Y, which intends that the symbol is related 

to yeast. 

10 Convenient yeast replication systems include 

the 2pm plasmid replication system, combination of CEN3 
and ARS1 or ARS3, or the like. The replication systems 
may be high or low copy number, depending on the effect 
of the construct on the viability of the host* While 
15 the indicated replication systems are those which have 
found common employment, any replication system useful 
in yeast may be employed which provides for efficient 
replication and maintenance. Often the structural gene 
will be inserted into an appropriate shuttle vector 
capable of replication and selection in either a yeast 
or bacterial host, where the resulting construction 
will have the following formula: 

-(P.R.(2)-P.R.(l))-gene-T^R.-Rep(y)-(M(Y)) -Rep(B)-(M(B)) - 
I * 

25 where all symbols have been defined previously. Also, 
it is, of course, understood that the cassette without 
an inserted structural gene but containing the restric- 
tion enzyme recognition sequence, R.S., may be propagated 
in yeast or contained within a shuttle vector, where 

30 the construction will have the following respective 
formulae: 



20 



-(P.R.(2)-P.R.(l))-R.S.-T.R.-Rep(Y)-(M(Y)) - 

_ Li 
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-(P.R.(2)-P.R.(1))- R.S.-T.R.-Rep(Y)-(M(T)) a -Rep(B)-(M(B)) a - 



where all symbols have been defined previously. 

The various fragments which form the cassette 
5 and final constructions may be joined together in accor- 
dance with conventional ways. In many cases, genes 
have been isolated and restriction mapped, as well as 
sequenced. To that extent, one can select the sequence 
of interest by restriction of the gene, employing fur- 

10 ther manipulation as necessary such as resection with 

Bal 31, in vitro mutagenesis, primer repair, or the like, 
. to provide a fragment of a desired size, including the 
desired sequence, and having the appropriate termini. 
Linkers and adapters can be used for joining sequences, 

15 as well as replacing lost sequences, where the restric- 
tion site is internal to the region of interest. The 
various fragments which are isolated,, may be purified 
by electrophoresis, electroeluted, ligated to other 
sequences, cloned, reisolated and further manipulated. 

20 The use of regulatory sequences for control- 

ling transcription of the structural gene of interest 
allows for growing the host cells to high density with 
no or low levels of expression of the structural gene, 
and then inducing expression by changing the environmen- 

25 tal conditions, e.g., nutrient, temperature, etc. 

For example, with the GAL4 regulatory region, 
the yeast cells could be grown in rich media with a 
glycerol-lactic acid combination to high density, e.g., 
mid or late log phase, followed by switching the carbon 

30 source to galactose. For FH05 regulation one could 

grow the cells at high phosphate, about 1 to lOmM, and 
then decrease the phosphate concentration to about 0.1 
to 0.5mM. For temperature sensitivity, one could grow 
the cells at 25° to 37°C and then change the temperature 

35 as appropriate by about 5° to 20°C. The host cells 
would have the regulatory system associated with the 
regulatory region employed. 
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Various techniques will be exemplified in the 
Experimental section of this application, which tech- 
niques can be used as paradigmatic for constructions 
employing fragments from sources other than those exem- 
5 plified* Of particular interest, as evidenced by the 
Experimental section, will be the use of the glyceralde- 
hyde-3 -phosphate dehydrogenase promoter region for the 
RNA polymerase binding site in conjunction with regula- 
tor sequences, such as those associated with GAL4, PH05, 

10 ADHII or the like. In referring to the GAL4 regulatory 
region or associated sequence, the region intends the 
sequence associated with regulation of other galactose 
metabolism genes, e.g., GAL1 and GAL10, which are under 
the regulatory control of such sequence in conjunction 

15 with the expression product of the GAL4 gene. The FHOS 
and ADHII sequences refer to regions associated with 
the FH05 and ADHII genes which provide for transcrip- 
tional regulation of the PH05 and ADHII gene, respec- 
tively. 

20 The following examples are offered by way of 

illustration and not by way of limitation. 

EXPERIMENTAL 
All DNA manipulations were done according to 
standard procedures. See Molecular Cloning , T. Maniatis 

25 et al . , Cold Spring Harbor Lab., 1982. Enzymes used in 
cloning were utilized as per the manufacturer's speci- 
fications. Enzymes were obtained either from New England 
Biolabs or Bethesda Research Laboratories. Procedures 
with these enzymes employed the supplier's directions. 

30 Yeast were transformed and grown using a variety of 
media including selective medium (yeast nitrogen base 
without leucine); YEPD medium, containing 1% (w/v) yeast 
extract, 2% (w/v) peptone and 2% (w/v) glucose, and 
others as appropriate and/or detailed below. In the 

35 case of plating medium contained 2% (w/v) agar and for 
transformation 3% top agar. Hepatitis B surface antigen 
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was determined after lysis of yeast by glass bead agita- 
tion and clarification by centrifugation, using the 
Ausriall assay (Abbott Laboratories). Protein is deter- 
mined by the Coomassie dye binding method. 

5 construction of GAL regulator conta ining plasmids. 

Plasmid pLGSD5 is prepared as described in 
Guarente et al. , (1982) supra . The plasmid was manipu- 
lated as follows: After restriction with Xhol, the 
overhangs were filled in with the Klenow fragment of 
10 DNA polymerase I ("Klenow fragment"), ligated with EcoRI 
linkers (GGAAXTCC) and then completely digested with 
' EcoR I and Sau3 A to provide a 370bp fragment which was 
isolated by gel electrophoresis and included the inter- 
genic sequence between GAL1 and GAL10 genes of yeast, 
15 and provides for the GAL4 regulation sequence of the 
GAL1 and GAL10 genes. 

This fragment was inserted into pBR322 which 
had been completely digested with EcoR I and BamH I, fol- 
lowed by treatment with alkaline phosphatase to prevent 
20 oligomerization. The resulting plasmid pBRGAL4 was 
treated in two different ways. 

In the first procedure pBRGAL4 was completely 
digested with Sau3A, the overhangs filled in with the 
Klenow fragment, and the resulting blunt-ended fragment 
25 ligated with Sai l linkers (CGTCGACG), followed by diges- 
tion with Sail and Xho l. The resulting 370bp fragment 
was isolated by gel electrophoresis. This fragment has 
the original 370bp yeast GAL4 regulator sequence with 
Xho l and Sai l termini. 
30 The second fragment was obtained by complete 

digestion of pBRGAL4 with Xhol and Sail to provide a 
Xhol-Sall fragment which included the 370bp yeast GAL4 
regulator sequence as well as about 280bp of pBR322, 
the GAL4 sequence extending from Sau3A to Sai l. 



13 
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The two fragments were then cloned in the 
plasmid plot5. plots was prepared by inserting the 
40bp polylinker fragment of the following sequence 

EcoRI BamHI Ball I Xbal Hindlll 

, I I I T T 

5» AATTCCCGGGGATCCGTCGACCTGCAGATCTCTAGAAGCTTCAG 
3 1 GGGCCCCTAGGCAGCTGGACGTCTAGAGATCTTCGAAGTC 

III I 

Sma l Sai l Pst I PvuII 

into pBR322 as an EcoRI -PvuII substitution followed by 
insertion of the trp-lac promoter (Russell and Bennett, 
Gene (1982) 20:231-245) into the PvuII site with tran- 
scription oriented toward the polylinker sequence, 
plots was completely digested with Sai l , followed by 
treatment with alkaline phosphatase and the 370bp and 
650bp fragments independently inserted into plots to 
provide plasmids plot5GAL4/370 and plot5GAL4/650 , re- 
spectively. Each of the plasmids was then completely 
digested with BamH I and Sai l to reproduce the individual 
fragments extended by 6bp of the polylinker fragment. 
These fragments were then ligated into pCl/l, which had 
been completely digested with BamHI and Sail followed 
by treatment with alkaline phosphatase to prevent re- 
circularization. Plasmid pCl/l is a derivative of 
PJDB219 (Beggs, Nature (1978) 275:104) in which the 
region corresponding to bacterial plasmid pMB9 in 
pJDB219 has been replaced by pBR322 in pCl/l. The re- 
sulting plasmids were designated pCl/lGAL4/370 and 
PC1/1GAL4/650, respectively. The BamHI -Sail fragment 
is located in the pBR322 portion of the vector pCl/l. 

The next construction develops a hybrid pro- 
moter for expression of the Hepatitis B surface antigen 
(HBsAg or sAg), employing the RNA polymerase binding 
region of GAPDH. The previously prepared plasmid 
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pHBS56/16-3, a yeast shuttle vector containing the al- 
cohol dehydrogenase 1 (ADH1) promoter, the HBsAg gene 
and ADH terminator as a Sph I fragment, was digested 
with Sph I and the ends modified with Bam linkers. The 
5 Bam linkers have the sequence CGGATCCG. 

PHBS56/16-3 was prepared as follows: A 
Taq l- Bpa l fragment obtained from the HBsAg coding region 
which included 26bp of the pre-sAg region, 681bp of the 
sAg region and 128bp of the 3 * -untranslated region, was 

10 linked with EcoR I linkers and cloned at the EcoR I site 

in pBR322. The EcoR I linkers have the sequence GGAATTCC. 
The plasmid pHBS5 was thus obtained. 

After digesting pHBSS with EcoR I, the digest 
was resected with Bal3 1 and religated with EcoR I linkers 

15 (GGAATTCC). After digestion with EcoR I the material of 
about 800bp was isolated from a polyacrylamide gel. 
This isolate was then recloned into pBR322 which had 
been digested with EcoR I and treated with alkaline phos- 
phatase. Where the resection was to the sequence CATGG, 

20 which included the methionine codon, the EcoR I linker 
created an Nco l site. The plasmids were screened for 
the presence of an Nco l site and one of the plasmids 
chosen for further manipulation. This plasmid, desig- 
nated pHBS5-3, was restricted with EcoRI, the EcoRI 

25 fragment made blunt-ended with Klenow fragment and dNTPs, 
and the blunt-ended fragment was then restricted with 
Xbal to provide an about lOObp fragment having an Xba l 
overhang and blunt end at the former EcoRI site. 

pHBSS was then digested with Clal, made blunt- 

30 ended with the Klenow fragment and dNTPs, digested with 
Xba l, followed by alkaline phosphatase treatment. The 
lOObp fragment was then inserted into the vector to 
provide the plasmid pHBS6. Upon sequencing of the 
blunt-ended ligation site, it was found that an adeno- 

35 sine had been lost, so as to lose the EcoR I site, where 
the sequence was now ATCGATTCCCATGG. The Cla l and Nco l 
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sites were retained. The loss of the A resulted in 
pHBS6 having a single EcoR I site. 

PHBS5-3 was digested with EcoRI and the re- 
sulting EcoRI fragment having the sAg fragment isolated 
5 by gel electrophoresis and inserted into the vector 
PHBS16 (Valenzuela et al . , Nature (1982) 298:347-350). 
This plasmid has the ADH1 promoter and the sAg gene in 
an EcoRI fragment in a plasmid containing the 2pm origin, 
a TrpI gene and pBR322. The plasmid was digested with 

10 EcoRI, treated with alkaline phosphatase to prevent 

recircularization, and the EcoR I fragment from pHBS5-3 
inserted to provide pHBS16-3, where the sAg gene iso- 
lated as a Taq l- Hpa l fragment had been modified by Bal31 
resection. The plasmid pHBS16-3 was digested with Sph I 

15 and Xba l to provide a fragment which had the ADH promoter 
at the Sph terminus and the 5 '-end of the sAg gene. 

PHBS56 was then digested with Sph I. pHBS56 
was prepared from pCl/1 by digestion with Sph I, which 
deletes a portion of the plasmid spanning the 2um-pBR322 

20 joint. The active portion of the ADH1 promoter region 
is contained within the SphI -Hindi 1 1 fragment of approx- 
imately 300bp (Bennetzen et al. , J. Biol. Chem. (1982) 
257:301). The Sph I site in the ADH promoter begins at 
position -413 and the yeast terminator sequence is con- 

25 tained within a Hindlll-sphI fragment of about 330bp. 
In each case the Sph I site is distal to the coding re- 
gion. A 1500bp ADH1 promoter fragment terminating at 
position -9 (Hitzeman et al. , Nature (1981) 293 :717) 
and an approximately 450bp terminator unit from nucleo- 

30 tides 913 to 1368 in the ADH gene nucleotide sequence 
were joined at a Hindi I I site between the fragments and 
cloned into the BamHI site of the vector YEpl3 (Broach 
and Hicks, Gene (1979) 8:121) to provide pADH5. 

The HBsAg-DNA segment of pHBS5 was excised by 

35 EcoRI digestion, blunt-ended with the Klenow fragment 
and joined at both ends with Hindi I I linkers, CAAGCTTG. 
After digestion with Hindlll, the HBsAg fragment was 
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inserted into the Hindi 1 1 site of the plasmid pADH5 
which had been .digested at the Hindi 1 1 site intermediate 
the ADH1 promoter and terminator sequence. A plasmid 
with the HBsAg gene in the correct orientation as deter- 

5 mined by restriction analysis was designated pHBS22. 
The cassette was included between two Sph I restriction 
sites. pHBS22 was digested with Sph I to obtain a frag- 
ment of about 1500bp and inserted into Sph I digested 
pCl/1 to provide pHBS56 which was cloned in E. coli 

10 HB101. 

pHBS56 was digested with Sph I and Xba l to 
provide a l.lkb fragment having the ADH terminator re- 
• gion and the 3' -portion of the sAg gene with the SphI 
site proximal to the terminator region. The l.lkb 

15 Sph l- Xba l fragment was joined to the SphI -Xbal fragment 
from pHBS16-3, which resulted in providing the complete 
sAg gene in the correct orientation between the ADH 
promoter and terminator. This SphI -SphI fragment was 
then ligated to Sph I digested pHBS56, replacing the 

20 cassette of pHBS56 to provide the plasmid pHBS56/16-3 
with the resected sAg coding region fragment. The cas- 
sette was then excised from pHBS56/16-3 by digestion 
with SphI, followed by chewing back the overhangs with 
the Klenow fragment in the presence of dNTPs, then li- 

25 gated with BamHI linkers, followed by digestion with 

BamHI to provide a 1.6kb fragment which was isolated by 
gel electrophoresis. The fragment included the ADH 
promoter region, the sAg gene and ADH terminator region, 
as described above. This fragment was inserted into 

30 the BamHI site of pBR322 to provide pPGT16-3 which was 
digested with BamH I and Xba l and the resulting l.lkb 
fragment gel isolated, where the Xbal -BamHI fragment 
had the 3' portion of the sAg gene and the ADH ternary 
tor region. 

35 pHBS6 was digested with Xba l and Nco l and the 

94bp fragment gel isolated to provide the 5 '-portion of 
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the sAg gene. A synthetic adapter was prepared of the 
formula 

CGA 2 TA 3 ( CA ) 3 TA 3 CA 3 CAA 
T 2 AT 3 * CT * 3 ATjCTa 0110 ™ 

5 having TagI and Ncol termini and providing the -25 to 
-1 nucleotides of the GAPDH (GAP49) promoter and the 
initiation codon of the sAg gene. This synthetic frag- 
ment, the Nco l- Xba l fragment, and the Xba l - BamH I frag- 
ment were ligated simultaneously, followed by digestion 
10 with Tag I and BamH I. The resulting fragment was then 
substituted into pBR322 linearized with Cla l and BamH I, 
" followed by treatment with alkaline phosphatase. The 
resulting plasmid, which contains the -1 to -25bp of 
the GAPDH promoter region, the sAg gene, and the ADH 
15 terminator, where the Nco l restriction site is lost was 
called pHBSSLGAPsAgtADH. 

pGAPl, a plasmid prepared by insertion of a 
Hind i 1 1 fragment containing the GAPDH gene GAP49 (Holland 
and Holland, J. Biol. Chem. (1979) 254x5466-5474) in- 
20 serted in the Hindi 1 1 site of pBR322, was digested with 
Hin fl and a 500bp promoter containing fragment isolated. 
The fragment was resected with Bal3 1 to remove about 50 
or 90bp, followed by ligation with Hindi 1 1 linkers and 
digestion with Hind i II . pBR322 was digested with 
25 Hindi II, followed by treatment with alkaline phosphatase 
and the about 450 or 410bp fragment inserted to provide 
pGAP128 and pGAP396, respectively. 

pGAP128 was digested with Hind lll, the fragment 
made blunt-ended with the Klenow fragment and dNTPs and 
30 the resulting 450bp fragment isolated by gel electropho- 
resis. This fragment was inserted into Sma l digested 
plots, which had been treated with alkaline phosphatase, 
to provide plasmid plot5pGAP128, which contained about 
-400 to +27bp of the GAPDH promoter and coding region. 
35 Plasmid plot5pGAP396 was prepared from pGAP396 in an 
identical manner and thus differs from plasmid 
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plot5pGAP128 in having about 15-30 fewer bp at each 
terminus of the GAPDH promoter region (about -385 to 
-3). 

Plasmids GAP1-GAP4 were then prepared in the 
5 following manner. Plasmid plot5pGAP128 was digested 

with TagI and BamHI to provide an about 390bp Tagl-BamHI 
fragment which included the -26 to about -400bp of the 
GAPDH promoter region and a portion of the Hindi 1 1 and 
plots polylinker. pHBS6LGAPsAgtADH plasmid was also 
10 digested with Tag I and BamHI and a l.lkb TagI -BamHI 

fragment containing the 3' -terminus of the GAPDH promo- 
ter region, the sAg gene and the ADH terminator region 
• was gel isolated and ligated to the other TagI -BamHI 
fragment to provide a BamHI -BamHI fragment which included 
15 approximately 400bp of the GAPDH promoter region, the 

sAg gene in proper orientation for transcriptional regu- 
lation by the GAPDH promoter, followed by the ADH termi- 
nator region. This fragment was ligated into pBR322 
which had been digested with BamHI and treated with 
20 alkaline phosphatase to provide plasmid pPGT80. This 
BamHI cassette could now be isolated and inserted into 
plasmid pCl/1, at the BamHI site in the pBR322 portion 
of pCl/1, where in plasmid GAP1 the ADH terminator region 
is proximal to the amp r gene with the pBR322 portion 
25 divided into an approximately 4kb sequence including 

the amp r gene and a 375bp region separating the cassette 
from the 2um sequences. In GAP2, the promoter is adja- 
cent to the long pBR322 sequence with transcription in 
the same direction as the amp r gene. The same cassette 
30 was inserted into BamHI -digested pCl/lGAL4/650 to obtain 
plasmids GAP3 and GAP4, where GAP3 has the GAPDH promoter 
distal from the GAL4 regulator region and the long pBR322 
sequence and GAP4 has the GAPDH promoter adjacent to 
the GAL4 regulator region, which is adjacent to the 
35 long pBR322 sequence. 

Plasmids GAPS and GAP6 were isolated as follows. 
Plasmid plot5pGAP396 was digested with Sail and TagI 
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and a fragment containing 9bp of the plots polylinker 
sequence and the GAPDH promoter sequence extending from 
about -385 to -26bp was isolated. An approximately 
130bp Tagl-Xbal fragment including -25 to -lbp of the 
GAPDH promoter and +1 to +93bp of the sAg gene was ob- 
tained from pHBS6LGA?sAgtADH. A l.lkb Xbal -Sal I fragment 
containing the 3 '-portion of the sAg gene and the ADH 
terminator as well as 6bp of plot5 polylinker sequence 
was obtained from plasmid plot5sAgtADH (described below- 
— Pyruvate Kinase Promoter). These three fragments 
were ligated, digested with Sai l and then cloned into 
Sail -digested pCl/lGAL4/370 . GAP5 has the GAPDH prorao- 
• ter region adjacent to the GAL4 regulator region, which 
is proximal to the short pBR322 sequence, and GAP6 has 
> the GAPDH promoter region distal from the GAL4 regulator 
region and proximal to the long pBR322 sequence (see 
Fig. 1). 

Pyruvate kinase promoter . 

Plasmid pHBS6Pyk containing the sAg gene under 
0 the transcriptional regulatory control of the Pyk promo- 
ter was obtained by cloning a 4.4kb insert of yeast 
genomic DNA in pBR322 containing the Pyk gene and 911 
nucleotides of 5 ' -untranslated region, and digestion of 
this plasmid pPyk9.1.1 with Xbal. After making the 
25 ends blunted-ended, the linear fragment was digested 
with BamHI providing a 912bp BamHI -blunt fragment con- 
taining the Pyk promoter and 8 bases from the Pyk coding 
region. This fragment was inserted into the plasmid 
pHBS6, which had been digested with Ncol, blunt-ended 
30 and digested with BamHI . The plasmid pHBS6Pyk was to- 
tally digested with EcoRI, to obtain a fragment includ- 
ing the sAg gene and a portion of the Pyk promoter re- 
gion. The fragment was made blunt-ended with the Klenow 
fragment and dNTPs, followed by ligation to BamHI linkers 
35 digested with Xbal, which is internal to the sAg gene, 
the Xbal terminus made blunt-ended with the Klenow 
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fragment and dNTPs, followed by digestion with Banal, 
to provide a 5B0bp BamHI -blunt- ended (»al) fragment. 
The plasmid plots was digested with EcoRI, made blunt- 
ended, digested with BamHI and treated with alkaline 
phosphatase and the two fragments joined to provide 
plasmids P lot5PyksAg51 and plotSPyksAg.57. The two 
differ in that the BamHI site of the latter was not 
regenerated during cloning, possibly as a consequence 
of minimal nuclease contamination (digestion). 

plots was treated as previously described 
(EcoRI digestion, blunt-ended, BamHI digestion and treat- 
ment with alkaline phosphatase) and joined to a l.lkb 
• fragment obtained by digestion of pPGT16-3 with fflal, 
followed by blunt ending, followed by digestion with 
i BamHI and gel isolation. This fragment was introduced 
into plots to provide the plasmid plots sAgtADH. Again 
the BamHI site in this plasmid was not regenerated, 
presumably due to digestion by contaminating nuclease. 

Plasmids Pykl and Pyk2 were prepared as fol- 
3 lows. Plasmid plot5PyksAg51 was digested with BamHI, 
then with Xbal, and an approximately 580bp fragment 
containing about 480bp of Pyk promoter and 93bp of the 
5 '-end of the sAg gene was gel isolated. A l.lkb Xbal- 
Sal l fragment containing the 3 '-portion of the sAg gene, 
5 the ADH terminator and about 6bp of the plots polylinker 
was isolated from plotSAgtADH . These two fragments 
were ligated, digested with BamHI and Sail and then 
cloned into plasmid pCIA, which had been cleaved with 
BamHI and Sail and treated with alkaline phosphatase, 
30 to yield plasmid Pykl. Plasmid Pyk2 was prepared simi- 
larly but the 580bp Sal I -Xbal, Pyk promoter/HBsAg gene 
5 '-end fusion fragment was isolated from plot5PyksAg.57 
and included about 6bp of plots polylinker sequence 
upstream from the promoter region. Also the l.lkb 3fcal- 
35 BamHI fragment containing the 3 '-part of the HBsAg gene 
and the ADH terminator was derived from plasmid pPGT16-3 
Plasmids Pyk3-Pyk6 were prepared as follows. 
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Plasmid plotSPyksAgSl was digested with BamHI, then 
with Xba l and the about 580bp fragment containing the 
Pyk promoter and the S^part of the HBsAg gene isolated 
as above. The l.lkb BamHI-Xbal fragment, containing 
5 the 3 '-portion of the HBsAg gene and ADH terminator, 
was recovered from pPGT15-3, also as above, and the two 
fragments ligated, digested with BamH I and inserted 
with different orientations into the BamH I site of 
PC1AGAL4/650 (Pyk3, Pyk4). Plasmids Pyk5 and Pyk6 

10 were prepared similarly except that the Sall- Xba l frag- 
ment containing the Pyk promoter and 5 f -end of the sAg 
gene was isolated from plot5PyksAg.57 and the Xba l- Sal l 
' sAg gene 3 1 -portion/ADH terminator fusion fragment was 
derived from pi otS s AgtADH and thus both fragments in- 

15 eluded approximately 6bp of plots polylinker sequence. 
The cassette so formed was then cloned into the Sai l 
site of pCl/lGAL4/370 in opposite orientations. 

The six plasmids designated Pykl-6 (see Fig. 
2) are distinguished by Pykl having the promoter region 

20 proximal to the short pBR322 sequence; Pyk2 having the 
promoter region proximal to the long pBR322 sequence; 
Pyk3 having the promoter region proximal to the short 
pBR322 sequence and distal from the GAL4 sequence; while 
Pyk4 has the promoter region proximal to the GAL4 region, 

25 which in turn is proximal to the long pBR322 sequence; 
Pyk5 has the promoter region proximal to the GAL4 region 
which is proximal to the short pBR322 sequence; while 
Pyk6 has a promoter region distal from the GAL4 region 
and proximal to the long pBR322 sequence. 

30 These plasmids described above were trans- 

formed into S. carlsbergensis strain 2150-2-3 (available 
from Lee Hartwell, University of Washington) under con- 
ventional conditions (Hinnen et al . , Proc. Natl* Acad. 
Sci. USA (1978) 75:1929-1933). Cultures of 50-150ml 

35 were grown to mid or late log phase in rich media (YEP) 
under neutral conditions (3% glycerol, 2% lactic acid), 
and then inducing conditions (+2% galactose), or 
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repressing conditions (+2% glucose) for the final 1-2 
generations. After lysis with glass beads and clarifi- 
cation of the supernatants by centrifugation, HBsAg 
expression was determined as described above. The re- 
sults for the 12 plasmids are set forth in the following 
Table 1. 
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Construction of pPGAP . 

A yeast expression vector was prepared called 
pPGAP having a polyrestriction site linker between the 
GAPDH terminator and short promoter region. Plasmid 
plot5pGAP128 was digested with BamHI and Tagl to yield 
an approximately 390bp BamHI -Tag I fragment having the 
-400 to -26bp of the GAPDH promoter. The BamHI -TagI 
fragment was ligated to a synthetic fragment having the 
following sequence: 

CGAjTAg (CA) 3 TA 3 CA 3 CACCATG 3 A 2 T 2 CGT 2 AG 2 
T 2 AT 3 (CT) 3 AT 3 GT 3 GTGGTAC 3 T 2 A 2 GCA 2 TC 2 AGCT 

to provide a BamH I- Sal l fragment, which was digested 
with BamH I and Sai l and used to replace the BamHI -Sal I 
fragment of BamHI -Sal I digested pBR322 treated with 
alkaline phosphatase. After ligation, the plasmid 
pGAPNRS was obtained which was digested with BamHI and 
Sail to provide a 400bp BamHI -Sail fragment which was 
gel isolated. This fragment was ligated to an about 
900bp Sal I -BamHI fragment containing the GAPDH termina- 
tor region and a short segment of 3 f coding region and 
the resulting 1.4kb BamH I - BamH I fragment digested with 
BamH I. The Sal I - BamH I GAPDH terminator fragment was 
obtained by Sai l and BamH I digestion of pGAP2, a plasmid 
prepared by insertion of an about 3.3kb BamH I fragment 
containing the GAPDH gene GAP49 (Holland and Holland, 
supra ) into the BamH I site of pBR322. Plasmids pGAP2 
and pGAPl were obtained as follows: A yeast gene li- 
brary was prepared by inserting fragments obtained after 
partial digestion of total yeast DNA with restriction 
endonuclease Sau3A in lambda-phage Charon 28 (Blattner 
et al. , Science (1977) 196:161-169). The phage library 
was screened with DNA complementary to the yeast GAPDH 
mRNA and the yeast GAPDH gene from one of these clones 
was subcloned as either an about 3.3kb BamHI fragment 
in the BamH I site of pBR322 (pGAP-2) or as an about 
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2.1kb Hind i 1 1 fragment in the Hind i 1 1 site of pBR322 
(pGAP-1). 

pBR322 was digested with EcoR I and Sai l, the 
termini blunt-ended and ligated to BamHI linkers, fol- 
lowed by BamH I digestion and the BamHI -BamHI 3.8kb frag- 
ment gel isolated, recircularized by self-ligation, 
cloned and designated pBRARl-Sal. The 1.4kb BamH I - BamH I 
fragment was inserted into the BamH I -digested . alkaline 
phosphatase treated pBRARl-Sal vector to provide the 
plasmid pPGAP of about 5.3kb with the orientation in 
the opposite direction of the amp r . 

The plasmid phSOD was prepared as follows: 

Molecular cloning of hSOD cDNA . 

Total RNA was prepared from an adult human 
liver by the guanidinium thiocy anate /I i thium chloride 
method (Cathala et al., DNA (1983) 2:329-335). polyA 
RNA was used to synthesize double- stranded cDNA (Maniatis 
et al. , Molecular Cloning , 213-242, Cold Spring Harbor, 
1982) and this was passed over a Sepharose CL4B column 
to enrich for cDNAs of greater than 350bp (Fiddes and 
Goodman, Nature (1979) 281:351-356). The cDNA was in- 
serted at the PstI site of plot4, a pBR322 derivative 
having the following sequence replacing the Pst I - EcoR I 
site. 

Pst I Hinfl Alu l 
1 GGTGAATCC6TAATCATG6TCATAGCT6TTTCCTGTGT6AAATTGTTATCCGCTC 
ACGTCCACTTAGGCAITAGTACCAGTATCGACAMGGACACACTTTAACAATAGGCGAG 
HphI 

HindllAluI 

60 ACMTTCCACA(^TTATACGAGCCGATGATTAATTGTCMCAGCTCATTTCAGAATATn 
TGTTMGGTGTGTAATATGCTCGGCTACTAATTAACAGTTGTCGAGTAMGTCTTATAAA 



EcoRI 

120 GCCAGAACCGTTATGATGCGG 

CGGTCTTGGCAATACTACGCCTTAA 
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The cDNA insertion employed the oligo-dG:dC tailing 
method (Maniatis et al., supra ), B. coli strain D1210 
was transformed with this mixture and trans formants 
selected on L-agar containing lOpg/ml tetracycline 
5 (Kushner, S.R. (1978) In: Genetic Engineering , eds. 
Boyer, H.B. and Nicosia, S., (Elsevier/North Holland, 
Amsterdam) p. 17). Plasmid DNA constituting a liver 
cDMA library was prepared (Maniatis et al. , Molecular 
Cloning , pp. 86-94, Cold Spring Harbor 1982) directly 
10 from approximately 62,000 recombinant colonies plated 
at a density of approximately 3,000 colonies per 9cm 
diameter Petri dish. 

Isolation of r-hSOD clones . 

Strain D1210 was re trans formed with the liver 

15 cDNA library and about 40,000 clones were grown on nine 
14cm diameter Petri dishes. After transfer of the col- 
onies to nitrocellulose paper and chloramphenicol ampli- 
fication of plasmid DNA, the cells were lysed and the 
filters prepared for hybridization (Ish-Horowicz and 

20 Burke, Nucleic Acids Research (1981) 9:2989-2998). 

Oligonucleotide probes were employed for screening by 
hybridization, with the probes consisting of enzymatic- 
ally-radiolabeled, chemically-synthesized DNA molecules 
complementary to the mRNA encoding amino acid residues 

25 19 to 24 of the protein (Jabusch et al . , supra . ; Barra 

et al. , supra . ); the mixture had the following sequences: 

3* TTA AAA CTT GTT TTT CT 5 f 
G G C C C 

where all of the indicated possibilities for encoding 
30 the peptide sequence were prepared (32-fold degenerate). 

The probes were labeled with 32 P to a specific 
activity of l-3xl0 8 cpra/|jg and Millipore (0.45pm) filtered 
before use. Filters were prehybridized for 6hrs at 
30°C in 4x SSC, 2x Denhardts's solution, 40znM sodium 
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phosphate , pH 7.5, 300pg/ml sonicated salmon testes 
DNA. Hybridization was for 20hrs at 30 °C in the same 
solution containing 2xl0 6 cpm/ml hSQD DNA probe (residues 
19-24). Filters were washed in 4x SSC, once for 15min 
at r.t. and twice for 15min at 30°C, blotted dry and 
autoradiographed with an intensifying screen for 24hrs 
at -70°C 

Areas on the master plates that corresponded 
to duplicate positive signals were picked into L-broth 
and plasmid DNA prepared by the miniscreen procedure 
(Maniatis et al . , Molecular Cloning , 178, 368-369, Cold 
Spring Harbor 1982 ) • This DNA was cut with PstI and 
subjected to Southern blot analysis (Southern, J. Mol. 
Biol , (1975) 98:503-517) hybridizing initially with the 
previous labeled probes (amino acid residues 19-24) and 
then with additional radiolabeled probes derived from 
amino acid residues 109-114 and having the following 
sequences (all possible variations, 72-fold degenerate) 
present as a mixture: 

3« CTA GTA ACA TAA TAA CC 5» 
G G G G G 
T T 

One plasmid pool (pSODl) contained a cDNA insert of 
520bp that hybridized with both probes and after colony 
purification, plasmid DNA was prepared from this clone 
and sequenced by the method of Maxam and Gilbert ( Proc. 
Natl. Acad. Sci. USA (1977) 74:560-564). The hSOD cDNA 
clone pSODl constitutes the coding region for amino 
acids 10-153 of hSOD, a single translational stop codon 
and a 3 1 untranslated region. Therefore, in the expres- 
sion vector construct, the base sequence of the region 
encoding amino acids 1-9 is derived from the published 
amino acid sequence of hSOD (Jabusch et al., supra ; 
Barra et al . , supra ) and synthesized chemically as a 
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part of the variable linker segment (see discussion 
relating to Fig. 3). 

Construction of plot5 derivatives co ntaining r-hSOD. 

The synthetic DNA molecules F(26), C(16), 
B(31), D(U), E(13) and 4(24) shown in Fig. 3, were 
synthesized by the phosphoramidite method. 

The single strand 4(24) was prepared by using 
all four bases, at each site where X is indicated. 
Furthermore, silica was withdrawn from the synthesis of 
the 24mer, such that single-stranded 21mers, 22mers, 
and 23mers are obtained in addition to the 24mers. 
After removal from the silica support, the four mixtures 
are combined in appropriate proportions to provide for 
eguimolar amounts of each of the possible single strands. 
This mixture was treated as a single product in the 
subsequent steps. 

Molecules F(26), C(16), B(31) and D(ll) were 
mixed together in eguimolar amounts and lOug phosphory- 
lated using T4 polynucleotide kinase. After phenol-ether 
extraction, the additional non-phosphorylated synthetic 
DNA molecules 4(24) and E(13) were added, such that all 
fragments were eguimolar. The eguimolar mixture con- 
tained 13ug of DNA in 133ul of 0.3x kinase buffer. 

After annealing by cooling at a uniform rate 
from 70°C to 20°C over 60min, the single strands were 
ligated together with T4 ligase in 200ul ligation mix 
at 14°C for 4hrs, phenol-chloroform extracted, ethanol 
precipitated and the 5' -ends of 4(24) and E(13) phospho- 
rylated using T4 polynucleotide kinase (Maniatis et 
al . , supra ) . Preparative polyacrylamide gel electro- 
phoresis was used to isolate the completely ligated 
53bp material having 5'- and 3 • -overhangs . 

The above purified fragment mixture was then 
ligated to the 460bp Taql-PstI segment of the hSOD cDNA 
as shown in Fig. 3. This segment was itself constructed 
by isolating the 454bp Tagl- Alu l hSOD fragment, making 
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it flush-ended using Klenow and inserting it into plots 
between its EcoRI and Sai l sites which had been simi- 
larly made flush-ended. After preparation of plasmid 
DNA from this recombinant, the 460bp Taq l- Pst l hSOD 
5 fragment was isolated by preparative polyacrylamide gel 
electrophoresis. After extraction and precipitation, 
the 515bp fragment resulting from the joining of the 
synthetic fragment to the 460bp Tagl-PstI hSOD fragment 
was blunt-ended (525-528bp> and then digested with Sai l 
10 and the resulting 519-522bp hSOD fragment isolated by 
polyacrylamide gel electrophoresis. This fragment was 
then inserted into plots which had been digested with 
" PvuII and Sail and then treated with alkaline phospha- 
tase. The resulting plasmids were used to transform 
15 strain D1210. Recombinants obtained after transforma- 
tion of strain 01210 were selected on L-agar containing 
lOOpg/ml ampicillin to give a set of clones, which were 
screened for an Nco l site. One was selected and desig- 
nated phSOD. 

Construction of a yeast vector for SOD expression . 

The plasmid phSOD was digested with Nco l and 
Sail and a 550bp fragment obtained, which included 1 
nucleotide untranslated at the 5 1 -terminus and the entire 
coding region for hSOD. pPGAP was digested with Nco l 
and Sail followed by treatment with alkaline phosphatase 
and the Sal l -Kco l fragment substituted for the Ncol -Sal I 
fragment in pPGAP to provide pPGAFSOD. BamH I digestion 
of pPGAPSOD resulted in a 2kb fragment which was gel 
isolated and inserted into the BamHI site of pCl/1 and 
pCl/1 GAL4/370. These plasmids were transformed into 
yeast strain 2150-2-3 as described previously, with the 
results of expression set forth in the following Table 
2. 
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Table 2: Expression of Human SOD in Yeast Strain 2150 



Plasmid 


Carbon Source 


SOD 2 
Hg/mg protein 


pCl/1 


g, I 1 


0 


pCl/lGAFSOD 


g, L 


148 


pCl/lGALGAPSOD 


8, L 
gal 


0.4 
68 



1 All cultures grown in Minus Leucine media with 2% lactic acid, 
3% glycerol with or without 2% galactose to late log or early 
.stationary phase. 



Determined by RIA. 



hSOD levels were measured using a standard radioimmuno- 
assay with iodinated authentic hSOD as standard. Con- 
stitutive synthesis from the GAP promoter leads to very 
high levels of hSOD production, of the order of 10-30% 
of the total cell protein. The induction with galactose 
works almost as well, yielding about 7% of the cell 
protein as hSOD. 

Construction of pYASIl 

This yeast expression plasmid contains the 
hSOD gene fused to the amino terminus of the human pro- 
i&sulin gene, with a methionine codon at the junction 
between both genes. The fusion gene is under control 
of the hybrid inducible ADH2-GAP (yeast alcohol dehydro- 
genase 2) promoter and the GAP terminator. An about 
3kbp BamH I expression cassette was constructed. 

The ADH2 portion of the promoter was con- 
structed by cutting a plasmid containing the wild type 
ADH2 gene (plasmid pADR2, see Beier and Young, Nature 
(1982) 300:724-728), a pBR322 derivative, which contains 
the yeast centromere gene from chromosome 3 (CEN3) (U.S. 
Patent No. 4,464,472), the Trpl gene and the yeast ADHII 
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gene with the restriction enzyme EcoR S , which cuts at a 
position +66 relative to the ATG start codon, as well 
as in two other sites in pADR2, outside of the ADH2 
region in the CEN 3 and Trpl gene. The resulting mixture 
5 of a vector fragment and two smaller fragments was re- 
sected with Bal3 1 exonuclease to remove about 300bp. 
Synthetic Xho l linkers were ligated onto the Bal3 1 
treated DMA. The resulting DNA linker vector fragment 
was separated from the linkers by column chromatography, 

10 cut with the restriction enzyme Xho l, religated and 
used to transform E. coli to ampicillin resistance. 
The positions of the Xho l linker additions were deter- 
* mined by DNA sequencing. One plasmid which contained 
an Xho l linker located within the 5 f non- transcribed 

15 region of the ADH2 gene (position -232 from ATG) was 
cut with the restriction enzyme Xho l, treated with nu- 
clease SI, and subsequently treated with the restriction 
enzyme EcoR I to create a linear vector molecule having 
one blunt end at the site of the Xho l linker and an 

20 EcoR I end at the pBR322 proximal terminus. 

The GAP portion of the promoter was constructed 
by cutting plasmid pPGAP (supra) with the enzymes BamHI 
and EcoR I , followed by the isolation of the 0.4Kbp DNA 
fragment. The purified fragment was cut with the enzyme 

25 Alu l to create a blunt end near the BamHI site. 

Plasmid pJS014 was constructed by the ligation 
of the Alul -EcoRI GAP promoter fragment to the ADH2 
fragment present on the linear vector described above. 

Plasmid pJS104 was digested with BamHI (which 

30 cuts upstream of the ADH2 region) and with Nco l (which 
cuts downstream of the GAP region). The about 1.3Kbp 
fragment containing the ADH2-GAP promoter was gel puri- 
fied and ligated to an about 1.7Kbp fragment containing 
the hSOD-proinsulin fusion DNA sequences and GAP termi- 

35 nator present in pYSIl. This 3Kbp expression cassette 
was cloned into BamH I digested and phosphatase treated 
pCl/1 to yield pYASIl. 
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Plasmid pYSIl was constructed as follows. 
Three fragments were employed which involve a 454bp 
NcoI-Sau3A isolated from phSOD, where the fragment in- 
cludes the entire coding sequence for human superoxide 
5 dismutase (hSOD) with the exception of the last three 
3»- codons; a 51bp Sau3A-HindIII synthetic adapter, 
which codes for the last three codons of hSOD, methio- 
nine, and the first 14 codons of proinsulin; and a 231bp 
Hindlll-Sall fragment, isolated from pINS5 (a pBR322- 

10 derived vector which contains a proinsulin coding se- 
quence chemically synthesized according to the amino 
acid sequence reported by Bell et al . , Nature (1979) 
' 282 :525-527), which encodes proinsulin excepting the 
first 14 amino acids. These fragments were ligated 

15 together and introduced into the plasmid pPGAP, which 
had been previously digested with Ncol and Sai l and 
alkaline phosphatase treated. The resulting plasmid 
pSIl was digested with BamH I to provide an expression 
cassette which was cloned into plasmid pCl/1 to yield 

20 pYSIl. 

Cloning of alpha-l-anti trypsin . 

A cDNA library was made from lOpg of polyA 
RNA isolated from a part of a human liver. This library 
was prepared by oligo-dT priming of the first cDNA strand 

25 and self-priming of the second cDNA strand. The ds 
cDNA was size fractionated on a Sepharose CL4B column 
and those molecules greater than 300bp isolated. This 
fraction was treated with nuclease SI and tailed with 
dCTP, using terminal transferase- The tailed cDNA was 

30 annealed to pBR322 which had been digested with PstI 
and tailed with dCTP . Transformation of E. coli HB101 
yielded 60,000 colonies, where greater than 90% of the 
clones were recombinant. 

Two synthetic oligonucleotide probes were 

35 used to isolate the alpha- 1 - anti tryp s in (c^-AT) cDNA, 
the first probe corresponding to amino acid residues 
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344-350 near the C-terminus of the protein was used to 
probe 5,000 colonies and the second probe, corresponding 
to amino acid residues -23 to -17 (+1 being the first 
nucleotide of the first codon of the mature c^-AT) of 
5 the signal peptide, was used to probe 25,000 colonies. 
The probe sequences were taken from the partial nucleo- 
tide sequence described by Kurachi et al . , Froc. Natl. 
Acad. Sci. USA (1981) 78:6826; Leicht et al . , Nature 
(1982) 297:655). Approximately 3% of the colonies hy- 

10 bridized to the C-terminal probe and four hybridized to 
the N- terminal probe. The four N- terminal clones and 
12 C-terminal clones were isolated and subjected to 
' restriction analysis. From these, three overlapping 
clones which cover the entire cDNA were subjected to 

15. further study and were used to construct the full length 
cDNA clone. 

The entire sequence of a composite full length 
CDNA derived from the three plasmids is as follows: 
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34 Met Pro Sep Ser 

GGGGGGGCGGACGGTAATCGACA ATC CCG TCT TCT 

-10 -i 

i & l£8 28 li'c & L c?8 c!8 & 8E L c?8 % % #8 Si Sc? 2ft fS fcfl » 
! 88 ESS 88 Si » « » 88 iB It* ffi IS fS "cic H cif » 8S of! Sic 3 

— BamHI 

r, in j& m *ac as us is ees as m m si r ut « » % he ssg as 
,a m m Ki as * sa is ik sa % & as fs & & ss % & is &* 

,K ft? CC? S& cS IS C?8 Sci IS &B £ Kg ACT SiS OA? 8X Ikl cfS 88 ffi cfl 

ai >«n Phe Aan Leu Thr Glu He Pro Glu Ala Gin lie His Glu Glv Phe Gin Glu Leu Leu 
2*1 AAT TTC AAC CTC ACG GAG ATT CCG GAG OCT CAG ATC CAT GAA GGC TTC CAG GAA CTC CTC 

, ,„ „» Asp Gly(e) 

mi A n Thr Leu Aan Gin Pro Asp Sep Gin Leu Gin Leu ThP Thr Gly Asn Gly Leu Phe Leu 
301 CAT ACC CTC AAC CAG CCA GaS AGC CAG CTC CAG CTG ACC ACC GGC AAT GCfi CTG TTC CTC 

121 Sen Glu Gly Leu Lys Leu Val Asp Lys Phe Leu Glu Asp Val Lys Lys Leu Tyr His Sep 
361 AGO GAG GGC CTG AAG CTA CTC GAT AAG TTT TTG GAG GAT GTT AAA AAG TTG TAC CAC TCA 

iui n.i 41* Phe Thr Val Asn Phe Gly Asp ThP Glu Glu Ala Lys Lys Gin lie Asn Asp Typ 
It] GAA GCC TTC ACT CTC AAC TTC GG6 GAG ACC GAA CAG GCC AAG AAA CAG ATC AAC GAT TAC 

IK V S £8 kXS 8» IS M HI at? 8S ffi Sft BS SS B? SE S3 SS IS 

• »] ft* ft » » Kt i» ffi & TTC ft ffi »E ffi & ci8 iSi SS ft g! SS 

Ala(b) 

18! 88 SSE IES 88 8." 88 13 ft? Sic ft 88 88 SA IS ISJft 88 8. cc? SS 

221 Met Lys APg Leu Gly Met Phe Asn lie Gin His Cys Lys Lys Leu Ser Ser Tro Val Leu 
661 ATG AAG CCT TTA GGC ATG TTT AAC ATC CAG CAC TCT AAG AAG CTG TCC ACC TGG CTG CTG 

Asn(c) «.,„,. , 

i M 0 r i vs Tvr Leu Glv Asn Ala Thr Ala He Phe Phe Leu Pro Asp Glu Gly Lys Leu 
?21 CTG ATG AAA TAC CTG G& AAT GCC ACC CCC ATC TTC TTC CTG CCT GAT GAG GGG AAA CTA 

rin mq i on niu Asn Glu Leu ThP His Aso He lie ThP Lys Phe Leu Glu Asn Glu Aso 
781 WO CAC CTG GAA AAT GAA CTC ACC CAC GA? ATC ATC r ACC AAG TTC CTC GAA AAT GAA CAE 

-*qi c a w na car Leu His Leu Ppo Lvs Leu Sep lie ThP Gly ThP Typ Asp Leu Lys 

IS, AGA AcS TCT GCC AGC TTA CAT TTA CCC AAA CTG TCC ATT ACT GGA ACC TAT GAT CTG AAG 

301 Ser lli ( Leu } Gly Gin Leu Gly lie Thr Lys Val Phe Ser Asn Glv Ala Asp Leu Ser Glv 
901 AGC ATC CTG GOT CAA CTG GGC ATC ACT AAG CTC TTC AGC AAT GGG OCT GAC CTC TCC GGG 

•55 1 vai Thr r.tn Glu Ala Pro Leu Lys Leu Ser Lys Ala Val His Lys Ala Val Leu Thr He 
U\ GTC ACA GAG GAG GCA CCC CTG AAG CTC TCC A*G GCC CTG CAT AAG CCT CTG CTG ACC ATC 

•jni Aan Glu Lva Glv Thr Glu Ala Ala Gly Ala Met Phe Leu Glu Ala He Pro 
,021 GA? GAG AAA GgIj ACT GAA OCT OCT GGG GCC ATG TTT TTA GAG GCC ATA CCC 



Met Ser 
ATG TCT 



lie 
ATC 



i£i or.r. rii. wa» i w« Phe Asn Lvs Pro Phe Val Phe Leu Met lie Glu Gin Asn Thr Lys 
,6*81 CCC CCC GAG CTC aXS TTC AAC ffi CCC TTT CTC TTC TTA ATG ATT GAA CAA AAT ACC AAC 

— Aval 

,?$! % ccc & SB SK col kS ctg aat 8S IS 8! \R & acccrcrccacacAAC 



- Hinfl 

AAT CCC ACC CAA AAA TAG 

GGG TGG GTT TTT ATC AGCT 

Sail 



1201 CCCTCCCCTCCATCCCTGGCCCCCTCCCTGGATGACATTAAAGAAGGGTTGAGCTGGAAAAAAAAAAAAAAAAAAAAA/ 
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LEGEND 



Nucleotide and predicted amino acid sequences 
of a- -AT cDNA. The reactive center met-ser at 
positions 358-359 is boxed. Subscripts to 
5 amino acids in parentheses identify differences 

between the subject protein sequence and those 
derived from (a) protein sequencing (Carrell 
et al., 1982), (b) the cDNA of Woo et al . , 
[see Carrell et al . , 1982]), and (c) the cDNA 
10 of Bollen et al . , 1983 . The synthetic DNA 

molecules used in the construction of the 
BamH I to Sai l fragment encoding the mature 
protein are shown as are the cDNA restriction 
sites used in this construction. 



15 The above sequence was determined using the 

dideoxy sequencing method of Sanger et al., Proc. Natl. 
Acad. Sci. USA (1977) 74:5463, in the M13 vectors of 
Messing et al . , Nucleic Acids Res. (1981) 9:309. The 
differences at the nucleotide and amino acid level from 

20 the published cDNA sequences are shown. 

Construction of the full length clone for 
expression of yeast began with three fragments isolated 
from cDNA clones: 1) a 630bp BamHI-BstEII fragment; 2) 
a 450bp BstEII-Aval fragment; and 3) an 85bp Ayal-Hinfl 

25 fragment. A synthetic adapter was employed having the 
following sequence: 

A 2 TC 3 AC 3 A 5 TAG 

G-TG-T C ATCAGCT 



Approximately two pmoles of fragments 1 and 2 were li- 
30 gated together and after removal of the ligase, digested 
with BamHI and Ava l. Fragment 3 and the synthetic adap- 
ter were ligated and digested with Ava l and Sai l and 
the two resulting fragment mixtures were ligated followed 
by digestion with BamH I and Sail. Fragments migrating 
35 upon electrophoresis in the region of about 1000-1400bp 
were isolated and cloned by substitution into BamH I and 
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Sail digested and alkaline phosphatase treated pBR322. 
Thl resulting plasmid is referred to as pATi. 

Plasmid pPGAP was digested with Ncol, followed 
by blunt-ending, followed by Sail digestion and treat- 
me nt with alkaline phosphatase. The Ncol-Sall fragment 
was substituted with an approximately 1250bp blunt-ended 
(BamHI)-Sall fragment obtained from plasmid pATi, by 
BamHI digestion, blunt ending, and Sail digestion. 
This was inserted into the pPGAP vector to produce the 
plasmid pGAPATi , a 6.6kb plasmid, which was digested 
with Ncol and BamHI and a 2.3kb KcoI-BamHI fragment 
obtained having the « X -AT gene and the GAPDH terminator 
• and approximately 400bp BamHI-Ncol fragment obtained 
having the GAPDH promoter. These fragments were ligated 
> together and inserted into the BamHI site of pCl/1. 

The plasmids pCl/lGAPATi8 and P Cl/lGAPATi9 were obtained 
with the orientation of expression clockwise in the^ 
former and counterclockwise in the latter, with amp. 
being in the counterclockwise direction. These plasmids 
0 were transformed in S. cerevisiae AB103 (A.T.C.C. No. 
20658, deposited January 5, 1983) by standard methods, 
selecting for leucine prototrophy and grown as described 
above. Yeast extracts were prepared by lysis with glass 
beads and the c^-AT activity determined by inhibition 
25 of human leukocyte elastase. 

Assays contained in lml:0. 1-0.2 human leukocyte 
elastase (HLE); .O.lmM M eO-Suc-Ala-Ala-Pro-7al- E -nitro- 
anilide (Beatty et al . , .t. Biol. Chem. (1980) 255:3931); 
50mM Tris, P H 8, 0.5M NaCi, and the indicated amounts 
30 of yeast extract or human er^-AT. Assays were initiated 
by the addition of elastase, incubated at 28«C for ISmin, 
terminated by the addition of lOOpl of 8N acetic acid 
and the absorbance at 410nm determined. Typical results 
are shown in the following Table 3. 
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The above data demonstrate that plasmids having the 
orientation of the expression cassette in the counter- 
clockwise orientation, the promoter proximal to the 
long sequence of pBR322, make 10-20 times more cr^-AT 
5 than the same cassette in the other orientation. 

Yeast strain AB110. 

Yeast strain 2150-2-3 was crossed with a yeast 
strain AB103 transformant containing pCl/lGAPATi9 . The 
diploids were sporulated and the tetrads dissected. 

10 Strains were maintained on leucine selective plates in 
order to ensure maintenance of the plasmid, since the 
* parents are autotrophs. A series of colonies were 
screened for their genotype with respect to a number of 
markers. The most vigorous strains were selected and 

15 cultures grown on leucine selective media. The best 

strain was designated AB110 (pCl/lGAPATi9 ) , gave 6-7.5% 
of the total cell protein as cr^-AT as shown in the above 
Table 3. The strain AB110 has the following genotype: 
Mater, ura3-52, leu2-04 or both leu2-3 and leu2-112, 

20 pep4-3, his4-580 (cir°). 

Phosphate induction . 

Plasmid pPGT80 was digested with BamH I , the 
ends blunt-ended, followed by digestion with Xba l and 
the 500bp fragment containing the GAFDH promoter and 

25 5 '-end of the sAg gene isolated. 

The PH05 gene was isolated from a yeast genomic 
library employing an oligonucleotide probe 
5 1 -GGCACTCACACGTGGGACTAG-3 1 derived from the published 
partial sequence (Meyhack et al . , The EMBO Journal (1982) 

30 1:675-680). A subfragment of this clone containing 
550bp of the 5' -untranslated region and approximately 
80 bp of coding sequence was subcloned as a BamH I - Sai l 
substitution in pBR322 to provide pPHOS. This fragment 
has the sequence 5 1 -ATGTTTAAA-3 % , encoding the first 

35 three amino acids, the second and third coddns 
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specifying an Ahalll site. The plasmid pHBS6 was digested 
with Ncol, blunt-ended, followed by digestion with BamH I 
and treatment with alkaline phosphatase. The PH05 pro- 
moter region was obtained by digesting the pPH05 plasmid 
5 with Aha lll, resecting the resulting fragment with Bal3 1 
for a short time, followed by digestion with BamH I and 
isolation of a 500-550bp BamH I blunt-ended fragment. 
This fragment was employed for substitution of the 
Nco l - BamH I fragment: from pBBS6 and was screened for 

10 regeneration of the Nco l restriction site to provide 
plasmid pHBS6PH05/l. 

Plasmid pHBS6PH05/l was digested with BstE II 
' which cleaves at position -175 in the PH05 promoter. 
This molecule was blunt-ended, digested with Sail and 

15 the 650bp fragment having the 5 1 -portion of the promoter 
domain, containing 275bp of pBR322 and 375bp of the 
PH05 promoter region isolated. This fragment was li- 
gated with the blunt-ended (BamHI) -Xbal fragment ob- 
tained from digestion of pPGT80 with BamH I, blunt end- 

20 ing, followed by Xbal digestion. After digesting the 

ligated fragment with Sai l and Xba l, the resulting frag- 
ment was then substituted into pPGT16-3 which had been 
digested with Sai l and Xba l and treated with alkaline 
phosphatase. The resulting plasmid pPHO5PGT80 had a 

25 cassette comprising the PH05 regulatory region, the 
GAPDH promoter, the sAg gene and the ADH terminator. 
This cassette was excised from the plasmid by BamH I 
digestion, whereby a 1.8kb BamH I - BamH I fragment was gel 
isolated and ligated into the BamH I site of BamH I di- 

30 gested and alkaline phosphatase treated pCIA to provide 
plasmids PH05GAP1 and PH05GAP2 where the PH05 was distal 
and proximal to the long pBR322 sequence, respectively. 

The two plasmids were transformed into yeast 
strain 2150-2-3 as described above and grown in rich 

35 media as described above for 8 to 10 generations in 
either high (7mM) or low (0.2raM) phosphate. Samples 
were harvested in late log phase and HBsAg determined 
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as described previously. The results are shown below 
in Table 4. 



Table 4: Regulation of HBsAg Production in Yeast using a 
Hybrid PHOS/GAPDH Promoter. 


High Phosphate Low Phosphate 
Construction C7mM) (0 . 2mM) 

(sAg Mg/mg protein) 


Induction 
low / high 


PH05GAP-1 0.08 0.95 


12.0 


PH05GAP-2 0.27 0.40 


1.5 


From the above results, it is 


evident that 



effective regulation with phosphate is obtained, with 
one orientation being superior to the other. 

It is evident from the above results, that 
highly efficient expression can be obtained, either 
constitutive or regulated, by providing for truncated 
promoter regions of yeast glycolytic enzyme gene promo- 
ters, employing the 3 1 domain proximal to the coding 
region of the gene in conjunction with a 5' -portion or 
second domain of the promoter region of a yeast gene 
subject to inducible regulation by a nutrient, e.g., 
carbon source or phosphate, temperature, or other exter- 
nally controllable source or condition. Alternatively, 
the second domain may be replaced by prokaryotic se- 
quences of at least about lkb or greater, which provide 
for constitutive enhancement in the absence of the sec- 
ond domain of the promoter region. Thus, a wide variety 
of genes exogenous to yeast may be expressed in high 
yield in high percentages of the total protein of the 
yeast host. 



41 0164556 

Although the foregoing invention has been 
described in some detail by way of illustration and 
example for purposes of clarity of understanding, it 
will be obvious that certain changes and modifications 
may be practiced within the scope of the appended claims . 
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WHAT IS CLAIMED IS: 



1. In a method for obtaining expression of 
a foreign structural gene in a yeast host employing 
hybrid constructions involving transcriptional and 
5 translational regulatory signals recognized by said 
host joined to a foreign gene for transcriptional and 
translational regulation of expression in said host, 
wherein said construction is introduced into said host 
and said foreign gene is expressed to produce the en- 

10 coded polypeptide, 

the improvement which comprises: 
employing for transcriptional regulation, a 
DNA sequence having first and second domains, 

said first domain being proximal to said struc- 
15 tural gene, being of less than about 600bp and having a 
sequence which includes the RNA polymerase binding sxte 
and transcription initiation site of a yeast glycolytic 

enzyme gene; and 

said second domain distal to said structural 
20 gene providing for enhanced efficiency of transcription 
and being the regulatory region of a regulated yeast 
gene other than the natural gene of said first domain 
or a prokaryotic sequence of at least about 500bp. 

2. A method according to Claim 1, wherein 
25 said yeast glycolytic enzyme gene of said first domain 

is glyceraldehyde-3-phosphate dehydrogenase. 

3. A method according to Claim 2, wherein 
said regulatory region is the GAL1 or GAL10 regulatory 
region. 

30 4. A method according to Claim 2, wherein 

said regulatory region is the PH05 regulatory region. 
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5. A method according to Claim 1, wherein 
said yeast glycolytic enzyme of said first domain is 
pyruvate kinase. 

6. A method according to any of Claims 1, 
2, 3, 4, or 5, wherein said polypeptide is Hepatitis B 
surface antigen, superoxide dismutase, or c^-anti trypsin. 

7. In a method for obtaining expression of 
a foreign structural gene in a yeast host employing a 
hybrid construction involving transcriptional and trans- 
lational regulatory signals recognized by said host 
joined to a foreign gene for transcriptional and trans - 
lational regulation of expression in said host, wherein 
said construction is introduced into said host and said 
foreign gene is expressed to produce the encoded poly- 
peptide, the improvement which comprises employing for 
transcriptional regulation, a DNA sequence having first 
and second domains, 

said first domain being proximal to said struc- 
tural gene, being of about 200-500bp, being derived 
from the 5 '-untranslated region of the yeast glyceralde- 
hyde-3 -phosphate dehydrogenase or pyruvate kinase gene 
and extending downstream to at least nucleotide -10; 
and 

said second domain distal to said structural 
gene, being a prokaryotic sequence of at least about 
lOOObp, wherein said yeast cells containing said con- 
struct are grown to high density; and 

isolating said polypeptide. 

8. In a method for obtaining expression of 
a foreign structural gene in a yeast host employing 
hybrid constructions involving transcriptional and trans- 
lational regulatory signals recognized by said host 
joined to a foreign gene for transcriptional and trans - 
lational regulation of expression in said host, wherein 
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said construction is introduced into said host and said 
foreign gene is expressed to produce the encoded poly- 
peptide, the improvement which comprises employing for 
transcriptional regulation, a DNA sequence having first 
and second domains, 

said first domain being proximal to said 
structural gene, being of about 200-500bp and including 
the RNA polymerase binding site and transcription ini- 
tiation site of the yeast glyceraldehyde-3 -phosphate 
dehydrogenase or pyruvate kinase gene; and 

said second domain distal to said structural 
gene having a sequence providing for regulation of the 
' acid phosphatase FH05 gene or the GALl or GAL 10 gene, 

growing said host containing said construct 
in a nutrient medium to high density of said host under 
non-inducing conditions; 

changing the conditions to inducing conditions 
to induce expression of said structural gene; and 

isolating the polypeptide. 

9. A DNA construct comprising in the down- 
stream direction a yeast promoter region having first 
and second domains, an insertion sequence of from about 
4-100bp having at least one unique restriction site for 
the construct, and a termination region, wherein said 
two domains are a distal domain of at least about lOObp 
of a DNA sequence providing for regulation of a yeast 
gene or a prokaryotic sequence of at least about 500bp 
and a proximal domain which includes the KNA polymerase 
binding site and the transcription initiation site of a 
yeast glycolytic enzyme gene. 

10 • A DNA construct for expression of a 
structural gene foreign to yeast, comprising in the 
downstream direction: 
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a promoter region comprising two domains and 
providing for transcriptional and trans lational signals 
for initiation in yeast; 

a structural gene under regulatory control of 
5 said promoter region; and 

a termination region providing for transcrip- 
tional and translational termination in yeast; 

wherein the distal domain comprises a sequence 
of at least about lOObp including a sequence for regula- 
10 tion of a regulatable yeast gene or a prokaryotic se- 
quence of at least about 500bp and the proximal domain 
includes an UNA polymerase binding site and transcrip- 
tion initiation site of a yeast glycolytic enzyme gene. 

11. A construct according to Claim 10, where- 
15 in said gene is c^-anti trypsin, superoxidase dismutase 
or Hepatitis B surface antigen. 



12. A modified yeast host containing as an 
extrachromosomal element a plasmid vector having a rep- 
lication system, at least one marker, and a cassette 

20 comprising in the downstream direction of transcription, 
a promoter region functional in yeast for transcriptional 
and translational initiation, a structural gene foreign 
to said yeast under the transcriptional and translational 
control of said promoter region, and a terminator region 

25 functional in said yeast, wherein said promoter region 
has two domains, a distal domain distant from said gene 
of at least about lOObp comprising a sequence providing 
for regulation of a regulated yeast gene or a prokaryo- 
tic sequence of at least about 500bp, and a proximal 

30 domain proximal to said gene, comprising the RNA poly- 
merase binding site and transcription initiation site 
of a yeast glycolytic enzyme gene and of about 200 to 
600bp and extending to at least the -20nt of said yeast 
glycolytic enzyme gene. 
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13. A yeast cell according to Claim 12, where- 
in the yeast glycolytic enzyme gene for said proximal 
domain is glyceraldehyde-3-phosphate dehydrogenase or 
pyruvate kinase. 

5 14. A method for producing a foreign protein 

which comprises s 

growing cells according to Claim 12, wherein 
said foreign protein is produced. 

15. A method according to Claim 14, wherein 
10 said foreign protein is a ^antitrypsin, human superoxide 
• dismutase or Hepatitis B surface antigen. 



16. A method according to Claim 2, wherein 
said regulatory region is the ADHII regulatory region. 
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Description 

Field of the Invention 

s With the advent of hybrid DNA technology, production of mammalian proteins in microorganisms 
became a reality. For the first time, mammalian proteins could be synthesized in a unicellular microorgan- 
ism by introduction of a gene encoding for the mammalian protein under the transcriptional and translational 
control of regulatory sequences recognized by the microorganism host. The introduction of these foreign 
constructions into a microorganism host resulted in competition between the regulatory signals of the 

to construct and the regulatory signal endogenous to the host for the host systems involved with expression. 
The structural gene of interest is usually directed to a product which is nonproductive and may be injurious 
to the host. Therefore, host cells which can turn off the foreign gene can effectively dominate modified host 
cells. 

Substantial progress has been made in isolating sequences concerned with transcriptional and tran- 
75 slational regulation for protein expression. But frequently flanking sequences, as well as distant sequences, 
may also affect the efficiency and regulation of the expression of the protein. Therefore, as one manipulates 
these various sequences, removing them from their native environment, and joining them to unnatural 
sequences, that is sequences other than the wild-type sequence, one can never be certain as to the result. 
In order to enhance the economies of producing proteins in microorganisms, there have been 
20 substantial efforts directed to improving the efficiency of transcription and translation, maximizing the 
proportion of total protein directed to production of the desired product, enhancing the viability of the 
modified host, as well as improving the efficiency with which the modified host may be obtained. 

Description of the Prior Art 

25 

Guarente et al., Proc. Natl. Acad. Sci. USA (1982) 79:7410-7414, describe a hybrid promoter region 
employing the GAL4 regulatory region. Guarente and Ptashne, ibid. (1981) 785199-2203, report the 
presence of two domains in a yeast promoter, with a region upstream from the TATA sequence providing 
an activation site. Kramer et al., ibid. (1984) 81367-370. describe the regulated expression of a human 
30 interferon gene in yeast employing the yeast acid phosphatase promoter, where expression is induced by 
phosphate or a temperature shift. Tekamp-Olson et ah, Cold Spring Harbor Meeting, Molecular Biology of 
Yeast, 1983, describe the absence of deleterious effects on yeast viability when employing "short" 
promoters, as distinct from the presence of such effects when employing an extended promoter region or 
"long" promoters. 

35 EP-A 72 318 concerns the expression of the Hepatitis B surface antigen in yeast using the alcohol 
dehydrogenase promoter, devoid of its upstream regulatory region. Nature, Vol. 300 (1982), p. 724 - 728, 
discloses the promoter from one yeast gene used with a different yeast structural gene but does not 
disclose a hybrid activation site construct. EP-A 73657 discusses the possibility of constructing yeast hybrid 
promoters consisting of the regulator of one gene joined to the promoter of a second, highly expressed, 

40 gene in order to control expression. 

Summary of the invention 

The invention is directed to a method for obtaining expression of a foreign structural gene in a yeast 
45 host employing hybrid constructions involving transcriptional and translational regulatory signals recognized 
by said host joined to a foreign gene for transcriptional and translational regulation of expression in said 
host, wherein said construction is introduced into said host and said foreign gene is expressed to produce 
the encoded polypeptide, comprising: 

employing for transcriptional regulation, a DNA sequence having first and second domains, 
so said first domain being proximal to said structural gene, being of less than about 600bp and having a 
sequence which includes the RNA polymerase binding site and transcription initiation site of a yeast 
glyceraJdehyde-3-phosphate dehydrogenase gene; 

said second domain being proximal to said first domain and distal to said structural gene, providing for 
enhanced efficiency of transcription and being the regulatory region of a regulated yeast gene other than 
55 the natural gene of said first domain; and 

said regulatory region being the PH05 regulatory region. 

According to a preferred embodiment of the invention, said polypeptide is Hepatitis B surface antigen, 
superoxide dismutase or ai -antitrypsin, and said regulatory region is the ADHII regulatory region. 
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According to a further embodiment, the invention is directed to a method for obtaining expression of a 
foreign polypeptide in a yeast host employing a hybrid construction involving transcriptional and tran- 
slational regulatory signals recognized by said host joined to a foreign gene for transcriptional and 
translational regulation of expression in said host, wherein said construction is introduced into said host and 
s said foreign gene is expressed to produce the encoded polypeptide, comprising employing for transcrip- 
tional regulation, a DNA sequence having first and second domains, 

said first domain being proximal to said structural gene, being of about 200-500bp, being derived from the 
5'-untranslated region of the yeast glyceraldehyde-3-phosphate dehydrogenase or pyruvate kinase gene 
and extending downstream to at least nucleotide -10; and 
ro said second domain being distal to said first domain and proximal to said structural gene, being a 
prokaryotic sequence of at least about 1000bp, wherein said yeast cells containing said construct are grown 
to high density; and 
isolating said polypeptide. 

According to a further preferred embodiment, the invention is concerned with method for preparing a 
75 polypeptide in a yeast host, where the polypeptide is heterologous to the yeast and may be produced in 
low percentage amounts of total protein, the improvement which comprises: 

joining an open reading frame DNA sequence coding for said polypeptide with a second open reading 
frame DNA sequence coding for heterologous human superoxide dismutase to form a fusion polypeptide; 
introducing the sequence coding for said fusion polypeptide under conditions for expression in said yeast 
20 which includes an inducible transcriptional initiation regulatory region that consists essentially of, reading in 
a 5' to 3' direction, a yeast glycolytic enzyme promoter region and adjacent alcohol dehydrogenase-2 
control region, whereby said fusion polypeptide is expressed; 
isolating said fusion polypeptide to provide the polypeptide in high yield. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagrammatic view of plasmids GAP1-6; 
Figure 2 is a diagrammatic view of plasmids Pyk1-6; and 

Figure 3 indicates the DNA linker sequence and a flow diagram showing its use in a construct for hSOD. 

30 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

Novel methods are provided employing fragments for enhanced expression of structural genes in a 
yeast host. Constructs employing a hybrid promoter region provide for enhanced efficiencies of transforma- 
35 tion and greatly improved viability of the yeast host as contrasted with those employing a wild-type yeast 
promoter. Concomitant with the improved viability is increased expression of a structural gene, in 
comparison with the truncated promoter region, and, therefore, greatly enhanced overall yields of expres- 
sion products. 

For the purposes of the subject invention, the "promoter region" is divided into two domains: (1) the 

40 structural gene proximal region, which includes the transcription initiation site, the "TATA" sequence 
capping sequence, as appropriate, and an RNA polymerase binding sequence, which sequence intends a 
sequence which includes nucleotides upstream from the initiation site for directing the initiation of synthesis 
of the messenger RNA; and (2) a distal region which provides for regulated or constitutive expression, with 
enhanced expression as compared to the first domain linked to non-functional yeast DNA. 

45 The hybrid promoters of the subject invention employ the RNA polymerase binding region of a yeast 
glycolytic enzyme promoter and a region upstream from said polymerase binding region, which is different 
from the wild-type upstream region of the RNA polymerase binding region and provides for enhanced 
efficiencies of transcription. This distal region will be derived from either a sequence, usually a yeast 
sequence, involved in regulation of transcription, or a prokaryotic sequence which provides for enhanced 

so constitutive expression of the desired gene. 

Conveniently, cassettes or constructs can be prepared which provide for one or more restriction sites 
intermediate the promoter region and a related terminator region where the structural gene may be inserted, 
so as to be under the transcriptional control of the hybrid promoter region. By having one or more 
restriction sites, one can provide for ease of insertion of the structural gene intermediate the transcription 

55 initiation and termination regions. The cassettes which can be prepared comprising the transcriptional 
initiation and termination region, having appropriate restriction sites for structural gene insertion can be 
cloned in prokaryotic vectors, so that after insertion of the structural gene, the resulting cassette, including 
the structural gene, may be cloned, isolated and purified, before introduction into a yeast vector. 
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The cassette, will for the most part, have the following formula: 
-(P.R.(2)-P.R.(1 ))-R.S.-T.R.- 
5 wherein: 

P.R.(1) is the promoter region proximal to the structural gene and having the transcription initiation site, 
the RNA polymerase binding site, and including the TATA box, the CAAT sequence, as well as translational 
regulatory signals, e.g., capping sequence, as appropriate; 

P.R.(2) is the promoter region joined to the 5*-end of P.R.(1) associated with enhancing the efficiency of 
io transcription of the RNA polymerase binding region; 

R.S. is a sequence having one or more restriction recognition sites, preferably at least two restriction 
recognition sites, where the sites may result upon restriction into blunt ends or overhangs; 

T.R. intends the termination region, which will include the terminator, which may be a stem and loop 
structure, and which may be associated with one or more stop codons, a polyadenylation signal sequence, 
75. if any, as well as any other transcriptional and translational termination sequences. 

P.R.(1) will generally be at least about 150bp, more usually at least about 200bp, usually not more than 
about 600bp, more usually not more than about 500bp, generally not more than about 450bp and preferably 
less than about 400bp; the sequence will extend in the downstream direction of transcription to about bp 
+ 3, more usually bp -1 and may extend only to bp -20, more usually to bp -10 (the numbering intends that 
20 +1 is the first bp of the initiation codon with which the promoter region is associated in the wild-type host, 
while -1 is the immediately upstream bp and the integers increase in the direction of transcription; 

P.R.(1) will be derived from a strong yeast promoter, normally a glycolytic enzyme promoter, such as 
glyceraldehyde-3-phosphate dehydrogenase, pyruvate kinase, alcohol dehydrogenase, 
phosphoglucoisomerase, triose phosphate isomerase, phosphofructokinase, etc.; 
25 P.R.(2) will provide for an enhancing function of transcription, which enhancing function may provide for 
constitutive or regulated transcription; regulators will be derived from regions associated with regulation of 
yeast genes, other than the natural or wild-type gene associated with the first domain in the wild-type or 
natural host, such as acid phosphatase (PH05) or alcohol dehydrogenase II (ADH I and II), etc. For yeast 
regulatory sequences, the domain will usually be at least about 100bp, more usually at least about 200bp, 
30 for convenience generally not exceeding about 3kbp, usually not exceeding about 1kbp, desirably not 
exceeding about 600bp. The regulatory region will generally begin at least about 200bp from the initiation 
codon, usually at least about 300bp and may begin at 400bp or farther upstream from the initiation codon. 

Regulation can be as a result of a change in the chemical or physical environment of the host, such as 
a change in carbon source, e.g., glucose to galactose or vice versa; a change in concentration of a nutrient, 
35 e.g., an inorganic nutrient such as a phosphate; or a change in temperature, e.g., 25 *C to 35 - C. 
Constitutive transcription can be achieved employing prokaryotic sequences of at least about 1kbp for 
convenience, generally not exceeding about 5kbp; conveniently, the prokaryotic sequence can be obtained 
from the vector in which the cassette is cloned, illustrative vectors including pBR322, lambda. Charon 4A, 
PACYC184, pUCS.etc. 

40 R.S. will generally be at least 4bp, more usually at least 6bp, and may be 100bp or more, more usually 
being not more than about 60bp and may include one or more, usually not more than about 10 restriction 
sites, where such restriction sites may be illustrated by EcoRI, BamHI, Sail, Hindlll, Alul, Aval, Tagl, Hpa l, 
etc., having at least one unique restriction site for the construct sequences. 

T.R. is the termination region which will include the necessary transcriptional and translational signals 
45 for termination, such as the polyadenylation site, etc.; 

T.R. will generally be at least about 1O0bp, more usually at least 150bp, and generally less than about 
1kbp, usually less than about 600kbp; the termination region may be derived from any convenient yeast 
sequence, so long as the terminator balances the promoter, conveniently being derived from a glycolytic 
enzyme terminator, where the terminator may be associated with the same or different enzyme with which 
so the promoter is associated. 

Where a cassette is cloned in a bacterial vector, the construction will have the following formula: 



55 



-(P.R. (2)-P.R. (1) )-R.S.-T.R.-Rep(B)-(M(B) ) - 

• ( _, Li 
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wherein all the symbols have been defined previously, except for: 

Rep (B), which intends a replicon or replication system recognized by a prokaryotic host and may be 
derived from a plasmid or phage, such as ColE1, and R plasmid, e.g., pRK290, lambda, e.g., Charon 4A, 
Xdv, etc.; 

5 M is a marker which provides for selection of hosts containing the construction, where (B) intends a 
prokaryotic, e.g., bacterial, host and a intends an integer of from 0 to 3, usually 1 to 2, although additional 
markers may be present, where the marker allows for selection of the host containing the construct as well 
as providing for selective pressure on maintaining hosts having the construct; the markers include biocide 
resistance, such as antibiotic resistance, toxin resistance and heavy metal resistance; providing prototrophy 
70 to an auxotrophic host; providing immunity; and the like; 

the markers may provide for complementation of an auxotrophic host, e.g., his", ura". trp~,leu" 
genotype, resulting in prototrophy; resistance to metals, such as cup + genotype; resistance to antibiotics, 
such as amp r , tc r , cam r , stK, tuf genotype, etc.; 

b is 0 or 1 , intending that the construction is either linear or circular, usually circular. 
75 The above construct can be used for insertion of a wide variety of structural genes, both prokaryotic 
and eukaryotic, both naturally occurring and synthetic, where the genes may include signal leaders for 
secretion, and the like. The genes may express enzymes, hormones, proteins from pathogens for vaccines, 
structural proteins, lymphokines, membrane surface proteins, immunoglobulins, blood proteins, or the like. 
The particular structural gene which is inserted is not critical to this invention and any polypeptide or protein 
20 of interest may be prepared employing the constructions of the subject invention. The structural genes will 
usually be foreign to the yeast host, where foreign intends different from wild-type yeast structural genes 
and from a source that does not normally exchange genetic information with yeast. 

Usually, the structural gene will be at least about 36bp, and not more than about 20kbp, usually not 
more than about 3000bp, usually not more than about 1500bp. Included in the structural gene may be non- 
25 coding flanking regions, the S'-flanking region normally being quite short, usually less than about 30bp, 
while the 3 f -flanking region may be extended, usually not exceeding about 500bp. Thus, the structural gene 
fragment will usually include the translational stop codons for proper termination of amino acid chain 
extension. 

When the structural gene has been inserted into the cassette which is joined to a yeast replication 
30 system, normally including one or more markers recognized by yeast, the resulting construct will have the 
following formula: 



-<P.R. (2)-P.R. (l))-gene-T.R,-Rep(Y)-(M(Y)) - 

35 | a 



wherein all of the symbols have been defined previously except for: 

gene, which intends the structural gene, having its initiation codon and stop codons as appropriate; and 

40 Y, which intends that the symbol is related to yeast. 

Convenient yeast replication systems include the 2um plasmid replication system, combination of 
CEN3 and ARS1 or ARS3, or the like. The replication systems may be high or low copy number, depending 
on the effect of the construct on the viability of the host. While the indicated replication systems are those 
which have found common employment, any replication system useful in yeast may be employed which 

45 provides for efficient replication and maintenance. Often the structural gene will be inserted into an 
appropriate shuttle vector capable of replication and selection in either a yeast or bacterial host, where the 
resulting construction will have the following formula: 

50 -(P.R.(2)-P.R.(l))-gene-T.R.-Rep(Y)-(M(y)) -Rep(B)-(M(B)) - 
I a 



where all symbols have been defined previously. Also, it is, of course, understood that the cassette without 
55 an inserted structural gene but containing the restriction enzyme recognition sequence, R.S., may be 
propagated in yeast or contained within a shuttle vector, where the construction will have the following 
respective formulae: 
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-(P.R.(2)-P.R.(l))-R.S.-T.R.-Rep(Y)-(M(Y)) - 

CL 



s 



-(P.R.(2)-P.R.(1))- R.S.-T.R.-Rep(Y)-(M(Y)) -Rep(B)-(M(B)) - 

a a 



TO 



where all symbols have been defined previously. 

The various fragments which form the cassette and final constructions may be joined together in 
accordance with conventional ways. In many cases, genes have been isolated and restriction mapped, as 

T5 well as sequenced. To that extent, one can select the sequence of interest by restriction of the gene, 
employing further manipulation as necessary such as resection with Bal31, in vitro mutagenesis, primer 
repair, or the like, to provide a fragment of a desired size, including the desired sequence, and having the 
appropriate termini. Linkers and adapters can be used for joining sequences, as well as replacing lost 
sequences, where the restriction site is internal to the region of interest. The various fragments which are 

20 isolated, may be purified by electrophoresis, electroeluted, ligated to other sequences, cloned, reisolated 
and further manipulated. 

The use of regulatory sequences for controlling transcription of the structural gene of interest allows for 
growing the host cells to high density with no or low levels of expression of the structural gene, and then 
inducing expression by changing the environmental conditions, e.g., nutrient, temperature, etc. 

25 For example, for PH05 regulation one could grow the cells at high phosphate, about 1 to 10mM, and 
then decrease the phosphate concentration to about 0.1 to 0.5mM. For temperature sensitivity, one could 
grow the cells at 25* to 37 *C and then change the temperature as appropriate by about 5* to 20 *C. The 
host cells would have the regulatory system associated with the regulatory region employed. 

Various techniques will be exemplified in the Experimental section of this application, which techniques 

30 can be used as paradigmatic for constructions employing fragments from sources other than those 
exemplified. Of particular interest, as evidenced by the Experimental section, will be the use of the 
glyceraldehyde-3-phosphate dehydrogenase promoter region for the RNA polymerase binding site in 
conjunction with regulator sequences, such as those associated with PH05 or ADHII. The PH05 and ADHII 
sequences refer to regions associated with the PH05 and ADHII genes which provide for transcriptional 

35 regulation of the PH05 and ADHII gene, respectively. 

The following examples are offered by way of illustration and not by way of limitation. 

EXPERIMENTAL 

40 All DNA manipulations were done according to standard procedures. See Molecular Cloning , T. Maniatis 
et al., Cold Spring Harbor Lab., 1982. Enzymes used in cloning were utilized as per the manufacturer's 
specifications. Enzymes were obtained either from New England Biolabs or Bethesda Research Laborato- 
ries. Procedures with these enzymes employed the supplier's directions. Yeast were transformed and grown 
using a variety of media including selective medium (yeast nitrogen base without leucine); YEPD medium, 

45 containing 1% (w/v) yeast extract, 2% (w/v) peptone and 2% (w/v) glucose, and others as appropriate 
and/or detailed below. In the case of plating medium contained 2% (w/v) agar and for transformation 3% top 
agar. Hepatitis B surface antigen was determined after lysis of yeast by glass bead agitation and 
clarification by centrifugation, using the Ausriall assay (Abbott Laboratories). Protein is determined by the 
Coomassie dye binding method. 



Construction of GAL regulator containing plasmids . 

Plasmid pLGSDS is prepared as described in Guarente et a}., (1982) supra . The plasmid was 
manipulated as follows: After restriction with Xho l, the overhangs were filled in with the Klenow fragment of 
55 DNA polymerase I ("Klenow fragment"), ligated with EcoRI linkers (GGAATTCC) and then completely 
digested with EotRI and Sau3A to provide a 370bp fragment which was isolated by gel electrophoresis and 
included the intergenic sequence between GAL1 and GAL10 genes of yeast, and provides for the GAL4 
regulation sequence of the GAL1 and GAL10 genes. 
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This fragment was inserted into pBR322 which had been completely digested with EcoRI and Bam HI, 
followed by treatment with alkaline phosphatase to prevent oligomerization. The resulting plasmid pBRGAL4 
was treated in two different ways. 

In the first procedure p8RGAL4 was completely digested with Sau3A. the overhangs filled in with the 
Klenow fragment, and the resulting blunt-ended fragment ligated with Sail linkers (CGTCGACG), followed by 
digestion with Sail and Xhol. The resulting 370bp fragment was isolated by gel electrophoresis. This 
fragment has the original 370bp yeast GAL4 regulator sequence with Xhol and Sail termini. 

The second fragment was obtained by complete digestion of pBRGAL4 with Xhol and Sail to provide a 
Xhol-Sall fragment which included the 370bp yeast GAL4 regulator sequence as well as about 280bp of 
pBR322 t the GAL4 sequence extending from Sau3A to Sail. 

The two fragments were then cloned in the plasmid plots, plots was prepared by inserting the 40bp 
polylinker fragment of the following sequence 



EcoRI BamHI Ball I Xbal Hindi I I 

i i i r r 

5 ' AATTCCCGGGGATCCGTCGACCTGCAGATCTCTAGAAGCTTCAG 
3 1 GGGCCCCTAGGCAGCTGGACGTCTAGAGATCTTCGAAGTC 

III I 
Smal Sail PstI PvuII 



into pBR322 as an EcoRI- PvuII substitution followed by insertion of the trp-lac promoter (Russell and 
Bennett. Gene (1982) 20:231-245) into the PvuII site with transcription oriented toward the polylinker 
sequence, plots was completely digested with Sail, followed by treatment with alkaline phosphatase and the 
370bp and 650bp fragments independently inserted into plots to provide plasmids plot5GAL4/370 and 
plot5GAL4/650, respectively. Each of the plasmids was then completely digested with Bam HI and Sail to 
reproduce the individual fragments extended by 6bp of the polylinker fragment. These fragments werelhen 
ligated into pC1/1, which had been completely digested with BamHI and Sail followed by treatment with 
alkaline phosphatase to prevent recircularization. Plasmid pC1/1 is a derivative of pJDB219 (Beggs, Nature - 
(1978) 275:104) in which the region corresponding to bacterial plasmid pMB9 in pJDB219 has been 
replaced by pBR322 in pC1/1. The resulting plasmids were designated pC1/1GAL4/370 and 
PC1/1GAL4/650, respectively. The BamHI-Sall fragment is located in the pBR322 portion of the vector 
pC1/1. 

The next construction develops a hybrid promoter for expression of the Hepatitis B surface antigen 
(HBsAg or sAg), employing the RNA polymerase binding region of GAPDH. The previously prepared 
plasmid pHBS56/16-3, a yeast shuttle vector containing the alcohol dehydrogenase 1 (ADH1) promoter, the 
HBsAg gene and ADH terminator as a Sghl fragment, was digested with Sghl and the ends modified with 
Bam linkers. The Bam linkers have the sequence CGGATCCG. 

PHBS56/16-3 was prepared as follows: A Taql-Hpal fragment obtained from the HBsAg coding region 
which included 26bp of the pre-sAg region. 681 bp of the sAg region and 128bp of the 3'-untranslated 
region, was linked with E<x>RI linkers and cloned at the EcoRI site in pBR322. The EcoRI linkers have the 
sequence GGAATTCC. The plasmid pHBS5 was thus obtained. 

After digesting pHBSS with EcoRI, the digest was resected with Bal31 and religated with EcoRI linkers 
(GGAATTCC). After digestion with EcoRI the material of about 800bp was isolated from a polyacrylamide 
gel. This isolate was then recloned into pBR322 which had been digested with EodRI and treated with 
alkaline phosphatase. Where the resection was to the sequence CATGG, which included the methionine 
codon, the EcoRI linker created an Nwl site. The plasmids were screened for the presence of an Ncol site 
and one of the plasmids chosen for further manipulation. This plasmid, designated pHBS5-3, was restricted 
with Ea>RI, the EcoRI fragment made blunt-ended with Klenow fragment and dNTPs, and the blunt-ended 
fragment was then restricted with Xbal to provide an about 100bp fragment having an Xbal overhang and 
blunt end at the former EcoRI site. 

pHBSS was then digested with Clal, made blunt-ended with the Klenow fragment and dNTPs, digested 
with Xbal, followed by alkaline phosphatase treatment The 100bp fragment was then inserted into the 
vector to provide the plasmid pHBS6. Upon sequencing of the blunt-ended ligation site, it was found that an 
adenosine had been lost, so as to lose the EcoRI site, where the sequence was now ATCGATTCCCATGG. 
The Clal and Nwl sites were retained. The loss of the A resulted in pHBS6 having a single EooRI site. 

pHBS5-3 was digested with EcoRI and the resulting Eco RI fragment having the sAg fragment isolated 
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by gel electrophoresis and inserted into the vector pHBS16 (Valenzuela et al., Nature (1982) 298:347-350). 
This plasmid has the ADH1 promoter and the sAg gene in an Ea)RI fragment in a plasmid containing the 
2um origin, a Trgl gene and pBR322. The plasmid was digested with Eo)RI, treated with alkaline 
phosphatase to prevent recircularization, and the EcoRI fragment from pHBS5-3 inserted to provide 
s pHBSl6-3, where the sAg gene isolated as a Taql-Hpal fragment had been modified by Bal31 resection. 
The plasmid pHBSl6-3 was digested with Sghl and Xbal to provide a fragment which had the ADH 
promoter at the Sph terminus and the 5*-end of the sAg gene. 

pHBS56 was then digested with Sghl. pHBS56 was prepared from pCl/1 by digestion with Sghl, which 
deletes a portion of the plasmid spanning the 2um-pBR322 joint. The active portion of the ADH1 promoter 

ro region is contained within the Sphl-Hindlll fragment of approximately 300bp (Bennetzen et al., J. Biol. 
Chem. (1982) 257:301). The Sphl site in the ADH promoter begins at position -413 and the yeast terminator 
sequence is contained within a Hindlll-Sphl fragment of about 330bp. In each case the Sphl site is distal to 
the coding region. A 1500bp ADH1 promoter fragment terminating at position -9 (Hitzeman et al.. Nature - 
(1981) 293:717) and an approximately 450bp terminator unit from nucleotides 913 to 1368 in the ADH gene 

75 nucleotide sequence were joined at a Hindlll site between the fragments and cloned into the BamHI site of 
the vector YEp13 (Broach and Hicks, Gene (1979) 8:121) to provide pADH5. 

The HBsAg-DNA segment of pHBS5 was excised by Eco RI digestion, blunt-ended with the Klenow 
fragment and joined at both ends with Hindlll linkers, CAAGCTTG. After digestion with Hindlll, the HBsAg 
fragment was inserted into the Hindlll site of the plasmid pADHS which had been digested at the Hindlll site 

20 intermediate the ADH1 promoter and terminator sequence. A plasmid with the HBsAg gene in the" correct 
orientation as determined by restriction analysis was designated pHBS22. The cassette was included 
between two Sghl restriction sites. pHBS22 was digested with Sghl to obtain a fragment of about 1500bp 
and inserted into Sghl digested pC1/1 to provide pHBS56 which was cloned in E. coli HB101. 

pHBS56 was digested with Sghl and Xbal to provide a 1.1 kb fragment having the ADH terminator 

25 region and the 3 f -portion of the sAg gene with the Sphl site proximal to the terminator region. The 1.1 kb 
Sghl-Xbal fragment was joined to the Sphl-Xbal fragment from pHBSl6-3 ( which resulted in providing the 
complete sAg gene in the correct orientation between the ADH promoter and terminator. This Sphl-Sphl 
fragment was then ligated to Sghl digested pHBS56, replacing the cassette of pHBS56 to providelhe 
plasmid pHBS56/16-3 with the resected sAg coding region fragment. The cassette was then excised from 

30 pHBS56/16-3 by digestion with Sphl, followed by chewing back the overhangs with the Klenow fragment in 
the presence of dNTPs, then ligated with Bam HI linkers, followed by digestion with Bam HI to provide a 
1.6kb fragment which was isolated by gel electrophoresis. The fragment included the ADH promoter region, 
the sAg gene and ADH terminator region, as described above. This fragment was inserted into the Bam HI 
site of pBR322 to provide pPGT16-3 which was digested with BamHI and Xbal and the resulting 1.1 kb 

35 fragment gel isolated, where the Xbal-BamHI fragment had the 3' portion of the sAg gene and the ADH 
terminator region. 

pHBS6 was digested with Xbal and Ncol and the 94bp fragment gel isolated to provide the 5'-portion of 
the sAg gene. A synthetic adapter was prepared of the formula 



45 having Tagl and Ncol termini and providing the -25 to -1 nucleotides of the GAPDH (GAP49) promoter and 
the initiation codon of the sAg gene. This synthetic fragment, the Ncol-Xbal fragment, and the Xbal-BamHI 
fragment were ligated simultaneously, followed by digestion with Tagl and BamHI. The resulting fragment 
was then substituted into pBR322 linearized with Clal and BamHI, followed by treatment with alkaline 
phosphatase. The resulting plasmid, which contains the -1 to -25bp of the GAPDH promoter region, the sAg 

so gene, and the ADH terminator, where the Ncol restriction site is lost was called pHBS6LGAPsAgtADH. 

pGAP1, a plasmid prepared by insertion of a Hindlll fragment containing the GAPDH gene GAP49 
(Holland and Holland, J. Biol. Chem. (1979) 254:5466-5474) inserted in the Hindlll site of pBR322, was 
digested with Hinf l and a 500bp promoter containing fragment isolated. The fragment was resected with 
Ba|31 to remove about 50 or 90bp, followed by ligation with Hindlll linkers and digestion with Hindlll. 

55 pBR322 was digested with Hindlll, followed by treatment with alkaline phosphatase and the about 450 or 
410bp fragment inserted to provide pGAP128 and pGAP396, respectively. 

PGAP128 was digested with Hindlll, the fragment made blunt-ended with the Klenow fragment and 
dNTPs and the resulting 450bp fragment isolated by gel electrophoresis. This fragment was inserted into 



40 



CGA 2 TA 3 ( CA ) 3 TA 3 CA 3 CAA 

T 2 AT 3 * GT * 3 A T 3 GT 3 GTTGTAC 
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Sma l digested plots, which had been treated with alkaline phosphatase, to provide plasmid plot5pGAP128, 
which contained about -400 to +27bp of the GAPDH promoter and coding region. Plasmid plot5pGAP396 
was prepared from pGAP396 in an identical manner and thus differs from plasmid plots pGAP 128 in having 
about 15-30 fewer bp at each terminus of the GAPDH promoter region (about -385 to -3). 
5 Plasmids GAP1-GAP4 were then prepared in the following manner. Plasmid plot5pGAP128 was 
digested with Tagl and Bam HI to provide an about 390bp Taql- Bam HI fragment which included the -26 to 
about -400bp of the GAPDH promoter region and a portion of the Hindlll and plots polylinker. 
pHBS6LGAPsAgtADH plasmid was also digested with Tagl and Bam HI and a 1.1 kb Taql- Bam HI fragment 
containing the SMerminus of the GAPDH promoter region, the sAg gene and the ADH terminator region was 

io gel isolated and ligated to the other Taql- Bam HI fragment to provide a Bam HI- Bam HI fragment which 
included approximately 400bp of the GAPDH promoter region, the sAg gene in proper orientation for 
transcriptional regulation by the GAPDH promoter, followed by the ADH terminator region. This fragment 
was ligated into pBR322 which had been digested with Bam HI and treated with alkaline phosphatase to 
provide plasmid pPGT80. This Bam HI cassette could now be isolated and inserted into plasmid pC1/1, at 

75 the Bam HI site in the pBR322 portion of pC1/1, where in plasmid GAP1 the ADH terminator region is 
proximal to the amp r gene with the pBR322 portion divided into an approximately 4kb sequence including 
the amp r gene and a 375bp region separating the cassette from the 2um sequences. In GAP2, the 
promoter is adjacent to the long pBR322 sequence with transcription in the same direction as the amp r 
gene. The same cassette was inserted into BamHI-digested PC1/1GAL4/650 to obtain plasmids GAP3 and 

20 GAP4, where GAP3 has the GAPDH promoter distal from the GAL4 regulator region and the long pBR322 
sequence and GAP4 has the GAPDH promoter adjacent to the GAL4 regulator region, which is adjacent to 
the long pBR322 sequence. 

Plasmids GAPS and GAP6 were isolated as follows. Plasmid plot5pGAP396 was digested with Sail and 
Tag l and a fragment containing 9bp of the plots polylinker sequence and the GAPDH promoter sequence 

25 extending from about -385 to -26bp was isolated. An approximately 130bp Taql-Xbal fragment including -25 
to -1bp of the GAPDH promoter and +1 to +93bp of the sAg gene was obtained from 
pHBS6LGAPsAgtADH. A 1.1 kb Xbal-Sall fragment containing the 3 f -portion of the sAg gene and the ADH 
terminator as well as 6bp of plots polylinker sequence was obtained from plasmid plotSsAgtADH (described 
below— Pyruvate Kinase Promoter). These three fragments were ligated, digested with Sail and then cloned 

30 into Sail-digested pC1/1GAL4/370. GAPS has the GAPDH promoter region adjacent to the GAL4 regulator 
region, which is proximal to the short pBR322 sequence, and GAP6 has the GAPDH promoter region distal 
from the GAL4 regulator region and proximal to the long pBR322 sequence (see Fig. 1). 

Pyruvate kinase promoter . 

35 

Plasmid pHBS6Pyk containing the sAg gene under the transcriptional regulatory control of the Pyk 
promoter was obtained by cloning a 4.4kb insert of yeast genomic DNA in pBR322 containing the Pyk gene 
and 911 nucleotides of S'-untranslated region, and digestion of this plasmid pPyk9.1.1 with Xbal. After 
making the ends blunted-ended, the linear fragment was digested with Bam HI providing a 912bp Bam HI- 

40 blunt fragment containing the Pyk promoter and 8 bases from the Pyk coding region. This fragment was 
inserted into the plasmid pHBS6 t which had been digested with Ncol, blunt-ended and digested with 
BamHI. The plasmid pHBSBPyk was totally digested with EcoRI, to obtain a fragment including the sAg 
gene and a portion of the Pyk promoter region. The fragment was made blunt-ended with the Wenow 
fragment and dNTPs, followed by ligation to Bam HI linkers, digested with Xbal, which is internal to the sAg 

45 gene, the Xbal terminus made blunt-ended with the Klenow fragment and dNTPs, followed by digestion with 
BamHI, to provide a 580bp BamHI-blunt-ended (Xba l) fragment. The plasmid plots was digested with 
EcoRI, made blunt-ended, digested with Bam HI and treated with alkaline phosphatase and the two 
fragments joined to provide plasmids plotSPyksAgSI and plot5PyksAg.57. The two differ in that the Bam HI 
site of the latter was not regenerated during cloning, possibly as a consequence of minimal nuclease 

so contamination (digestion). 

plots was treated as previously described (EcoRI digestion, blunt-ended, Bam HI digestion and treat- 
ment with alkaline phosphatase) and joined to a 1.1 kb fragment obtained by digestion of pPGT16-3 with 
Xbal. followed by blunt ending, followed by digestion with Bam HI and gel isolation. This fragment was 
introduced into plots to provide the plasmid plotSsAgtADH. Again the Bam HI site in this plasmid was not 

55 regenerated, presumably due to digestion by contaminating nuclease. 

Plasmids Pyk1 and Pyk2 were prepared as follows. Plasmid plot5PyksAg51 was digested with Bam HI, 
then with Xba l, and an approximately 580bp fragment containing about 480bp of Pyk promoter and 93bp of 
the S'-end of the sAg gene was gel isolated. A 1.1 kb Xbal-Sall fragment containing the 3*-portion of the sAg 
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gene, the ADH terminator and about 6bp of the plots polylinker was isolated from plotSAgtADH. These two 
fragments were ligated, digested with Bam HI and Sail and then cloned into plasmid pC1/1, which had been 
cleaved with Bam HI and Sal) and treated with alkaline phosphatase, to yield plasmid Pyk1. Plasmid Pyk2 
was prepared similarly but the 580bp Sajl-Xbal, Pyk promoter/HBsAg gene S'-end fusion fragment was 
isolated from plot5PyksAg.57 and included about 6bp of plots polylinker sequence upstream from the 
promoter region. Also the 1.1 kb Xbal-BamHI fragment containing the 3*-part of the HBsAg gene and the 
ADH terminator was derived from plasmid pPGT16-3. 

Plasmids Pyk3-Pyk6 were prepared as follows. Plasmid plotSPyksAgSI was digested with BamHI, then 
with Xbal and the about 580bp fragment containing the Pyk promoter and the S'-part of the HBsAg gene 
isolated as above. The 1.1 kb Bam HI-Xbal fragment, containing the 3*-portion of the HBsAg gene and ADH 
terminator, was recovered from pPGT16-3, also as above, and the two fragments ligated, digested with 
BamHI and inserted with different orientations into the Bam HI site of pC1/lGAL4/650 (Pyk3, Pyk4). 
Plasmids PykS and Pyk6 were prepared similarly except that the Sall-Xbal fragment containing the Pyk 
promoter and S'-end of the sAg gene was isolated from plotSPyksAg.57 and the Xbal-Sall sAg gene 3*- 
portion/ADH terminator fusion fragment was derived from plotSsAgtADH and thus both fragments included 
approximately 6bp of plots polylinker sequence. The cassette so formed was then cloned into the Sail site 
of pC1/1GAL4/370 in opposite orientations. 

The six plasmids designated Pyk1-6 (see Fig. 2) are distinguished by Pyk1 having the promoter region 
proximal to the short pBR322 sequence; Pyk2 having the promoter region proximal to the long pBR322 
sequence; Pyk3 having the promoter region proximal to the short pBR322 sequence and distal from the 
GAL4 sequence; while Pyk4 has the promoter region proximal to the GAL4 region, which in turn is proximal 
to the long pBR322 sequence; PykS has the promoter region proximal to the GAL4 region which is proximal 
to the short pBR322 sequence; while Pyk6 has a promoter region distal from the GAL4 region and proximal 
to the long pBR322 sequence. 

These plasmids described above were transformed into S. carlsbergensis strain 2150-2-3 (available 
from Lee Hartwell, University of Washington) under conventional conditions (Hinnen et a|., Prog Natl. Acad. 
ScUJSA (1978) 75:1929-1933). Cultures of 50-150ml were grown to mid or late log phase in rich media 
(YEP) under neutral conditions (3% glycerol, 2% lactic acid), and then inducing conditions ( + 2% 
galactose), or repressing conditions (+2% glucose) for the final 1-2 generations. After lysis with glass 
beads and clarification of the supernatants by centrifugation, HBsAg expression was determined as 
described above. The results for the 12 plasmids are set forth in the following Table 1. 
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55 Construction of pPGAP . 

A yeast expression vector was prepared called pPGAP having a polyrestriction site linker between the 
GAPDH terminator and short promoter region. Plasmid plot5pGAP128 was digested with Bam HI and Tagl to 
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yield an approximately 390bp Bam HI-Taql fragment having the -400 to -26bp of the GAPDH promoter/The 
Bam HI-Taql fragment was ligated to a synthetic fragment having the following sequence: 

CGA 2 TA 3 < CA ) 3 TA 3 CA 3 CACCATG 3 A 2 T 2 CGT 2 AG 2 
T 2 AT 3 (GT)3AT 3 GT3GTGGTAC 3 T 2 A 2 GCA 2 TC 2 AGCT 

to provide a Bam HI-Sall fragment, which was digested with Bam HI and Sail and used to replace the BamHI- 
Sall fragment of Bam HI-Sall digested pBR322 treated with alkaline phosphatase. After ligation, the plasmid 
pGAPNRS was obtained which was digested with Bam HI and Sail to provide a 400bp Bam HI-Sall fragment 
which was gel isolated. This fragment was ligated to an about 900bp Sall- Bam HI fragment containing the 
GAPDH terminator region and a short segment of 3' coding region and the resulting 1 .4kb Bam HI- Bam HI 
fragment digested with Bam HI. The Sall -Bam HI GAPDH terminator fragment was obtained by Sail and 
BamHI digestion of pGAP2, a plasmid prepared by insertion of an about 3.3kb Bam HI fragment containing 
the GAPDH gene GAP49 (Holland and Holland, supra ) into the BamHI site of pBR322. Plasmids pGAP2 and 
pGAP1 were obtained as follows: A yeast gene library was prepared by inserting fragments obtained after 
partial digestion of total yeast DNA with restriction endonuclease Sau3A in lambda-phage Charon 28 
(Blattner et a}., Science (1977) 196:161-169). The phage library was screened with DNA complementary to 
the yeast GAPDH mRNA and the yeast GAPDH gene from one of these clones was subcloned as either an 
about 3.3kb BamHI fragment in the Bam HI site of pBR322 (pGAP-2) or as an about 2.1 kb Hindlll fragment 
in the Hindlll site of pBR322 (pGAP-1 ), 

pBR322 was digested with EcoRI and Sail, the termini blunt-ended and ligated to Bam HI linkers, 
followed by BamHI digestion and the Bam HI- Bam HI 3.8kb fragment gel isolated, recircularized by self- 
ligation, cloned and designated pBRAR1-Sal. The 1.4kb BamHI-BamHI fragment was inserted into the 
Bam HI-digested, alkaline phosphatase treated pBRAR1-Sal vector to provide the plasmid pPGAP of about 
5.3kb with the orientation in the opposite direction of the amp r . 

The plasmid phSOD was prepared as follows: 

Molecular cloning of hSOD cDNA. 

Total RNA was prepared from an adult human liver by the guanidinium thiocyanate/lithium chloride 
method (Cathala et al., DNA (1983) 2:329-335). polyA RNA was used to synthesize double-stranded cDNA 
(Maniatis et al., Molecular Cloning , 213-242, Cold Spring Harbor, 1982) and this was passed over a 
Sepharose® CL4B column to enrich for cDNAs of greater than 350bp (Fiddes and Goodman, Nature (1979) 
281:351-356). The cDNA was inserted at the Pstl site of plot4, a pBR322 derivative having the following 
sequence replacing the Pstl-EcoRI site. 

Pst l Hinfl Alul 
1 GGTGAATCCGTMTCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTC 
ACGTCCACTTAGGCATTAGTACCACTATCGACAAAGGACACACTTTAACAATAGGCGAG 
Hph I 

Hindi I Alul 

60 ACAATTCCACACATTATACGAGCCGATGATTAATTGTCMCAGCTCATTTCAGMTATTT 
TGTTAAGGTGTGTMTATGCTCGGCTACTAATTAACAGTTGTCGAGTAAAGTCTTATAAA 

EcoRI 

120 GCCAGAACCGTTATGATGCGG 

CGGTCTTGGCAATACTACGCCTTAA 

The cDNA insertion employed the oligo-dG:dC tailing method (Maniatis et al., supra ). E. c£li strain D1210 
was transformed with this mixture and transformants selected on L-agar containing I0ug/ml tetracycline 
(Kushner, S.R. (1978) In: Genetic Engineering , eds. Boyer, H.B. and Nicosia, S., (Elsevier/North Holland, 
Amsterdam) p. 17). Plasmid DNA constituting a liver cDNA library was prepared (Maniatis et al., Molecular 
Cloning, pp. 86-94, Cold Spring Harbor 1982) directly from approximately 62,000 recombinant colonies 
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plated at a density of approximately 3,000 colonies per 9cm diameter Petri dish. 
Isolation of r-hSOD clones . 

5 Strain D1210 was retransformed with the liver cDNA library and about 40,000 clones were grown on 
nine 14cm diameter Petri dishes. After transfer of the colonies to nitrocellulose paper and chloramphenicol 
amplification of plasmid DNA, the cells were lysed and the filters prepared for hybridization (Ish-Horowicz 
and Burke, Nucleic Acids Research (1981) 9:2989-2998). Oligonucleotide probes were employed for 
screening by hybridization, with the probes consisting of enzymatically-radiolabeled, chemically-synthesized 

70 DNA molecules complementary to the mRNA encoding amino acid residues 19 to 24 of the protein 
(Jabusch et al., supra .; Barra et al., supra .); the mixture had the following sequences: 



3 1 TTA AAA CTT GTT TTT CT 5 1 
G G C C C 



where all of the indicated possibilities for encoding the peptide sequence were prepared (32-fold degen- 
erate). 

20 The probes were labeled with ^P to a specific activity of l-SxKFcpm/ug and Millipore (0.45um) filtered 
before use. Rlters were prehybridized for 6hrs at 30 *C in 4x SSC, 2x.Denhardts's solution, 40mM sodium 
phosphate, pH 7.5, 300ug/ml sonicated salmon testes DNA. Hybridization was for 20hrs at 30 • C in the 
same solution containing 2x1(Fcpm/ml hSOD DNA probe (residues 19-24). Filters were washed in 4x SSC, 
once for 15min at r.t. and twice for 15min at 30 *C, blotted dry and autoradiographed with an intensifying 

25 screen for 24hrs at -70 • C. 

Areas on the master plates that corresponded to duplicate positive signals were picked into L-broth and 
plasmid DNA prepared by the miniscreen procedure (Maniatis et ah, Molecular Cloning , 178, 368-369, Cold 
Spring Harbor 1982). This DNA was cut with Pstl and subjected to Southern blot analysis (Southern, J. Mol. 
Biol . (1975) 98:503-517) hybridizing initially with the previous labeled probes (amino acid residues 19-24) 

30 and then with additional radiolabeled probes derived from amino acid residues 109-114 and having the 
following sequences (all possible variations, 72-fold degenerate) present as a mixture: 



35 3' CTA GTA ACA TAA TAA CC 5 1 

G G G G G 
T T 

One plasmid pool (pSOD1) contained a cDNA insert of 520bp that hybridized with both probes and after 
40 colony purification, plasmid DNA was prepared from this clone and sequenced by the method of Maxam 
and Gilbert (Proc. Natl. Acad. Sci. USA (1977) 74:560-564). The hSOD cDNA clone pSOD1 constitutes the 
coding region for amino acids 10-153 of hSOD, a single translational stop codon and a 3' untranslated 
region. Therefore, in the expression vector construct, the base sequence of the region encoding amino 
acids 1-9 is derived from the published amino acid sequence of hSOD (Jabusch et al., supra ; Barra et al., 
45 supra ) and synthesized chemically as a part of the variable linker segment (see discussion relating to"Rg. 
3). 

Construction of plots derivatives containing r-hSOD . 

so The synthetic DNA molecules F(26), C(16), B(31), D(11), E(13) and 4(24) shown in Fig. 3, were 
synthesized by the phosphoramidite method. 

The single strand 4(24) was prepared by using all four bases, at each site where X is indicated. 
Furthermore, silica was withdrawn from the synthesis of the 24mer, such that single-stranded 21mers, 
22mers, and 23mers are obtained in addition to the 24mers. After removal from the silica support, the four 
55 mixtures are combined in appropriate proportions to provide for equimolar amounts of each of the possible 
single strands. This mixture was treated as a single product in the subsequent steps. 

Molecules F(26), C(16), B(31) and D(11) were mixed together in equimolar amounts and 10ug 
phosphorylated using T4 polynucleotide kinase. After phenol-ether extraction, the additional non- 
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phosphorylated synthetic DNA molecules 4(24) and E(13) were added, such that all fragments were 
equimolar. The equimolar mixture contained 13ug of DNA in 133ul of 0.3x kinase buffer. 

After annealing by cooling at a uniform rate from 70 *C to 20 *C over 60min f the single strands were 
ligated together with T4 iigase in 200ul ligation mix at 14- C for 4hrs, phenol-chloroform extracted, ethanol 
precipitated and the S'-ends of 4(24) and E(13) phosphorylated using T4 polynucleotide kinase (Maniatis et 
a}., supra ). Preparative polyacrylamide gel electrophoresis was used to isolate the completely ligated 53bp 
material having 5*- and 3'-overhangs. 

The above purified fragment mixture was then ligated to the 460bp Tagl-Pstl segment of the hSOD 
cDNA as shown in Fig. 3. This segment was itself constructed by isolating the 454bp Taql-Alul hSOD 
fragment, making it flush-ended using Klenow and inserting it into plots between its EcoRI and Sail sites 
which had been similarly made flush-ended. After preparation of plasmid DNA from this recombinant, the 
460bp Tagl-Pstl hSOD fragment was isolated by preparative polyacrylamide gel electrophoresis. After 
extraction and precipitation, the 515bp fragment resulting from the joining of the synthetic fragment to the 
460bp Tagl-Pstl hSOD fragment was blunt-ended (525-528bp) and then digested with Sail and the resulting 
519-522bp hSOD fragment isolated by polyacrylamide gel electrophoresis. This fragment was then inserted 
into plots which had been digested with Pyull and Sail and then treated with alkaline phosphatase. The 
resulting plasmids were used to transform strain D1210. Recombinants obtained after transformation of 
strain D1210 were selected on L-agar containing 100ug/ml ampicillin to give a set of clones, which were 
screened for an Ncol site. One was selected and designated phSOD. 

Construction of a yeast vector for SOD expression . 

The plasmid phSOD was digested with Ncol and Sail and a SSObp fragment obtained, which included 1 
nucleotide untranslated at the S'-terminus and the entire coding region for hSOD. pPGAP was digested with 
Ncol and Sail followed by treatment with alkaline phosphatase and the Sall-Ncol fragment substituted for 
the Ncol-Sall fragment in pPGAP to provide pPGAPSOD. BamHI digestion of pPGAPSOD resulted in a 2kb 
fragment which was gel isolated and inserted into the Bam HI site of pCl/1 and pC1/1 GAL4/370. These 
plasmids were transformed into yeast strain 2150-2-3 as described previously, with the results of expression 
set forth in the following Table 2. 



Table 2 



Expression of Human SOD in Yeast Strain 2150 


Plasmid 


Carbon Source 


SOD 2 ug/mg protein 


pC1/1 


g. l 1 


0 


PC1/1GAPSOD 


g.L 


148 


pCl/1 GALGAPSOD 


g.L 


0.4 




gal 


68 



1 All cultures grown in Minus Leucine media with 2% lactic acid, 3% glycerol 
with or without 2% galactose to late log or early stationary phase. 

2 Determined by RIA. 



hSOD levels were measured using a standard radioimmunoassay with iodinated authentic hSOD as 
standard. Constitutive synthesis from the GAP promoter leads to very high levels of hSOD production, of 
the order of 10-30% of the total cell protein. The induction with galactose works almost as well, yielding 
about 7% of the cell protein as hSOD. 

Construction of pYASH 

This yeast expression plasmid contains the hSOD gene fused to the amino terminus of the human 
proinsulin gene, with a methionine codon at the junction between both genes. The fusion gene is under 
control of the hybrid inducible ADH2-GAP (yeast alcohol dehydrogenase 2) promoter and the GAP 
terminator. An about 3kbp Bam HI expression cassette was constructed. 

The ADH2 portion of the promoter was constructed by cutting a plasmid containing the wild type ADH2 
gene (plasmid pADR2, see Beier and Young, Nature (1982) 300:724-728), a pBR322 derivative, which 
contains the yeast centromere gene from chromosome 3 (CEN3) (U.S. Patent No. 4,464,472), the Trg1 
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gene and the yeast ADHII gene with the restriction enzyme EcoR5, which cuts at a position +66 relative to 
the ATG start codon, as well as in two other sites in pADR2, outside of the ADH2 region in the CEN 3 and 
Trp 1 gene. The resulting mixture of a vector fragment and two smaller fragments was resected with Bal31 
exonuclease to remove about 300bp. Synthetic Xhol linkers were ligated onto the Baj31 treated DNA. The 
5 resulting DNA linker vector fragment was separated from the linkers by column chromatography, cut with 
the restriction enzyme Xhol, religated and used to transform E. coji to ampicillin resistance. The positions of 
the Xhol linker additions were determined by DNA sequencing. One plasmid which contained an Xhol linker 
located within the 5' non-transcribed region of the ADH2 gene (position -232 from ATG) was cut with the 
restriction enzyme Xhol, treated with nuclease SI, and subsequently treated with the restriction enzyme 

70 EcoRI to create a linear vector molecule having one blunt end at the site of the Xhol linker and an EcoRI 
end at the pBR322 proximal terminus. 

The GAP portion of the promoter was constructed by cutting plasmid pPGAP (supra) with the enzymes 
BamHI and EcoRI, followed by the isolation of the 0.4Kbp DNA fragment. The purified fragment was cut 
with the enzyme AM to create a blunt end near the Bam HI site. 

J5 Plasmid pJS014 was constructed by the ligation of the AJul-EcoRI GAP promoter fragment to the ADH2 
fragment present on the linear vector described above. 

Plasmid pJSKM was digested with Bam HI (which cuts upstream of the ADH2 region) and with NojI 
(which cuts downstream of the GAP region). The about 1.3Kbp fragment containing the ADH2-GAP 
promoter was gel purified and ligated to an about 1.7Kbp fragment containing the hSOD-proinsulin fusion 

20 DNA sequences and GAP terminator present in pYSM. This 3Kbp expression cassette was cloned into 
Bam HI digested and phosphatase treated pCl/1 to yield pYASH. 

Plasmid pYSM was constructed as follows. Three fragments were employed which involve a 454bp 
Ncol-Sau3A isolated from phSOD, where the fragment includes the entire coding sequence for human 
superoxide dismutase (hSOD) with the exception of the last three 3*- codons; a 51 bp Sau3A-Hindlll 

25 synthetic adapter, which codes for the last three codons of hSOD, methionine, and the first 14 codons of 
proinsulin; and a 231 bp Hindlll-Sall fragment, isolated from pINSS (a pBR322-derived vector which contains 
a proinsulin coding sequence chemically synthesized according to the amino acid sequence reported by 
Bell et a|. f Nature (1979) 282:525-527), which encodes proinsulin excepting the first 14 amino acids. These 
fragments were ligated together and introduced into the plasmid pPGAP, which had been previously 

30 digested with Nco l and Sajl and alkaline phosphatase treated. The resulting plasmid pSI1 was digested with 
BamHI to provide an expression cassette which was cloned into plasmid pC1/1 to yield pYSH. 

Cloning of alpha-1 -antitrypsin . 

35 A cDNA library was made from 10ug of polyA + RNA isolated from a part of a human liver. This library 
was prepared by oligo-dT priming of the first cDNA strand and self-priming of the second cDNA strand. The 
ds cDNA was size fractionated on a Sepharose® CL4B column and those molecules greater than 300bp 
isolated. This fraction was treated with nuclease S1 and tailed with dCTP, using terminal transferase. The 
tailed cDNA was annealed to pBR322 which had been digested with Pstl and tailed with dGTP. Transforma- 

40 tion of E. cx>li HB101 yielded 60,000 colonies, where greater than 90% of the clones were recombinant. 

Two synthetic oligonucleotide probes were used to isolate the alpha-1 -antitrypsin (cn-AT) cDNA, the 
first probe corresponding to amino acid residues 344-350 near the C-terminus of the protein was used to 
probe 5,000 colonies and the second probe, corresponding to amino acid residues -23 to -17 (+ 1 being the 
first nucleotide of the first codon of the mature ^ -AT) of the signal peptide, was used to probe 25,000 

45 colonies. The probe sequences were taken from the partial nucleotide sequence described by Kurachi et 
§!•» Proc. Natl. Acad. Sci. USA (1981) 78:6826; Leicht et a}., Nature (1982) 297:655). Approximately 3% of 
the colonies hybridized to the C-terminal probe and four hybridized to the N-terminal probe. The four N- 
terminal clones and 12 C-terminal clones were isolated and subjected to restriction analysis. From these, 
three overlapping clones which cover the entire cDNA were subjected to further study and were used to 

so construct the full length cDNA clone. 
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The entire sequence of a composite full length cDNA derived from the three plasmids is as follows: 

'Met Pro Ser Ser 
GGGGGGGCGGAGGGTAATCGACA ATG CCC TCT TCT 

-JO -10 -1 

Val Ser Trp Gly He Leu Leu Leu Ala Gly Leu Cys Cys Leu Val Pro Val Ser Leu Ala 
CTC TCG TCG CGC ATC CTC CTC CTC CCA CCC CTC TCC TCC .CTC CTC CCT CTC TCC CTC OCT 

1 Clu A.ip Pro Gin Gly Asp Ala Ala Gin Lys Tiu* Asp Thr Scr His His Asp Gin Asp His 
1 GAG GAT CCC CAG CGA CAT CCT CCC CAG AAG ACA GAT ACA TCC CAC CAT CAT CAG GAT CAC 



BamHl 



**. Pro Thr Phe Asn Lys He Thr Pro Asn Leu Ala Glu Phe Ala Phe Ser Leu Tyr Arg Gin 
61 CCA ACC TTC AAC AAG ATC ACC CCC AAC CTG CCT GAG TTC CCC TTC AGC CTA TAG CGC CAG 

41 Leu Ala His Gin Ser Asn Ser Thr Asn He Phe Phe Ser Pro Val Ser He Ala Thr Ala 
121 CTG GCA CAC CAG TCC AAC AGC ACC AAT ATC TTC TTC TCC CCA GTG AGC ATC GCT ACA GCC 

61 Phe Ala Met Leu Ser Leu Gly Thr Lys Ala Asp Thr His Asp Glu lie Leu Glu Gly Leu 
181 TTT GCA ATG CTC TCC CTG GGC ACC AAG GCT GAC ACT CAC CAT GAA ATC CTG CAG GCC CTG 

81 Asn Phe Asn Leu Thr Glu He Pro Glu Ala Gin He His Glu Gly Phe Gin Glu Leu Leu 
2*1 AAT TTC AAC CTC ACC GAC ATT CCG GAG GCT CAG ATC CAT GAA GCC TTC CAG GAA CTC CTC 

Arg(a.c) Asp Gly(c) 

101 His Thr Leu Asn Gin Pro Asp Ser Gin Leu Gin Leu Thr Thr Gly Asn Gly Leu Phe Leu 
301 CAT ACC CTC AAC CAG CCA GAC AGC CAG CTC CAG CTC ACC ACC GGC AAT CGC CTC TTC CTC 

121 Ser Clu Cly Leu Lys Leu Val Asp Lys Phe Leu Glu Asp Val Lys Lys Leu Tyr His Ser 
361 ACC CAG CGC CTG AAG CTA GTG GAT AAG TTT TTG GAG GAT GTT AAA AAG TTG TAC CAC TCA 

1*1 Glu Ala Phe Thr Val Asn Phe Gly Asp Thr Glu Glu Ala Lys Lys Gin He Asn Asp Tyr 
*21 GAA CCC TTC ACT CTC AAC TTC CGC CAC ACC GAA GAG GCC AAG AAA. CAG ATC AAC GAT TAC 

161 Val Glu Lys Gly Thr Gin Gly Lys lie Val Asp Leu Val Lys Glu Leu Asp Arg Asp Thr 
*Sl GTG CAG AAG GGT ACT CAA GGG AAA ATT GTG GAT TTG CTC AAG CAG CTT GAC ACA CAC ACA 

. Ifll Val Phe Ala Leu Val Asn Tyr He Phe Phe Lys Cly Lys Trp Glu Arg Pro Phe Glu Val 
5*1 GTT TTT GCT CTG GTG AAT TAC ATC TTC TTT AAA CGC AAA TGG GAG ACA CCC TTT GAA GTC 

Ala(b) 

201 Lys Aap Thr Clu Glu Glu Asp Phe His Val Asp Gin Val Thr Thr Val Lys Val Pro Met 
601 AAC CAC ACC GAG CAA GAG GAC TTC CAC GTG GAC CAG GTG ACC ACC CTG AAC GTG CCT ATG 

- BstEII 

221 Met Lys Arg Leu Gly Met Phe Asn lie Gin His Cys Lys Lys Leu Ser Ser Trp Val Leu 
661 ATC AAC CCT TTA GGC ATG TTT AAC ATC CAG CAC TCT AAC AAG CTC TCC AGC TGG GTG CTG 

Asn(c) 

2*1 Leu Met Lys Tyr Leu Gly Asn Ala Thr Ala He Phe Phe Leu Pro Asp Glu Gly Lys Leu 
721 CTC ATG AAA TAC CTG GCC AAT GCC ACC CCC ATC TTC TTC CTG CCT GAT GAG GGG AAA CTA 

261 Gin His Leu Glu Asn Glu Leu Thr His Asp He He Thr Lys Phe Leu Glu Asn Glu Asd 
731 CAC CAC CTG CAA AAT GAA CTC ACC CAC GAT ATC ATC ACC AAG TTC CTG GAA AAT GAA GAC 

EcoHV 

281 Arg Arg Ser Ala Ser Leu His Leu Pro Lys Leu Ser He Thr Gly Thr Tyr Asp Leu Lys 
8*1 AGA ACC TCT CCC AGC TTA CAT TTA CCC AAA CTG TCC ATT ACT CGA ACC TAT GAT CTG AAG 

Val(a.c) 

301 Ser He Leu Gly Gin Leu Gly He Thr Lys Val Phe Ser Asn Gly Ala Asp Leu Ser Gly 
901 AGC ATC CTG GCt CAA CTG GCC ATC ACT AAG GTC TTC ACC AAT GCC GCT GAC CTC TCC GGft 

321 Val Thr Glu Glu Ala Pro Leu Lys Leu Ser Lys Ala Val His Lys Ala Val Leu Thr He 
901 CTC ACA CAC CAG CCA CCC CTC AAG CTC TCC AAC CCC CTG CAT AAG GCT CTC CTG ACC ATC 

3*» Asp Clu Lys Gly Thr Glu Ala Ala Gly Ala Met Phe Leu Glu Ala He ProlMet Ser I lie 
1021 CAC CAC AAA GCC ACT CAA GCT CCT GGC GCC ATC TTT TTA GAG GCC ATA CCc IaTG TCTl ATC 

36l Pro Pro Glu Val Lys Phe Asn Lys Pro Phe Val Phe Leu Met lie Clu Gin Asn Thr Lvs 
1081 CCC CCC CAG GTC AAG TTC AAC AAA CCC TTT GTC TTC TTA ATC ATT CAA CAA AAT ACC AAG 
— Aval 

381 Ser Pro Leu Phe Met Gly Lys Val Val Asn Pro Thr Gin Lys OC 
11*1 TCT CCC CTC TTC ATG CCA AAA CTC CTC AAT CCC ACC CAA AAA TAA CTCCCTCTCGCTCCTCAAC 

- Hinfl 

AAT CCC ACC CAA AAA TAG 

GCC TCC CTT TTT ATC A GCT 

Sail 

1^31 CCCTCCCCTCCATCCCTCCCCCCCTCCCTCCATGACATTAAACAAGGCTTCAGCTGCAAAAAAAAAAAAAAAAAAAAAA 



17 



EP 0 164 556 B1 



LEGEND 



Nucleotide and predicted amino acid sequences of 01 -AT cDNA. The reactive center met-ser at 
positions 358-359 is boxed. Subscripts to amino acids in parentheses identify differences between the 
5 subject protein sequence and those derived from (a) protein sequencing (Carrel! et al. t 1982), (b) the cDNA 
of Woo et a}., [see Carrell et al., 1982]), and (c) the cDNA of Bollen et a}., 1983. The synthetic DNA 
molecules used in the construction of the Bam HI to Sail fragment encoding the mature protein are shown as 
are the cDNA restriction sites used in this construction. 

The above sequence was determined using the dideoxy sequencing method of Sanger et aj., Proc. 
10 Natl. Acad. Sci. USA (1977) 74:5463, in the M13 vectors of Messing et al., Nucleic Acids Res. (1981) 9:309. 
The differences at the nucleotide and amino acid level from the published cDNA sequences are shown. 

Construction of the full length clone for expression of yeast began with three fragments isolated from 
cDNA clones: 1) a 630bp BamHI-BstEII fragment; 2) a 450bp BstEII-Aval fragment; and 3) an 85bp Ava l- 
Hinf l fragment. A synthetic adapter was employed having the following sequence: 

75 

A 2 TC 3 AC 3 A 5 TAG 

G 3 TG 3 T 5 ATCAGCT 

20 



Approximately two pmoles of fragments 1 and 2 were ligated together and after removal of the ligase, 
digested with Bam HI and Aval. Fragment 3 and the synthetic adapter were ligated and digested with Aval 
and Sail and the two resulting fragment mixtures were ligated followed by digestion with Bam HI and Sail. 
25 Fragments migrating upon electrophoresis in the region of about 1 000-1 400bp were isolated and cloned by 
substitution into Bam HI and Sail digested and alkaline phosphatase treated pBR322. The resulting plasmid 
is referred to as pATi. 

Plasmid pPGAP was digested with Ncol, followed by blunt-ending, followed by Sail digestion and 
treatment with alkaline phosphatase. The Ncol-Sall fragment was substituted with an approximately 1250bp 

30 blunt-ended (Bam HI)-Sall fragment obtained from plasmid pATi, by Bam HI digestion, blunt ending, and Sail 
digestion. This was inserted into the pPGAP vector to produce the plasmid pGAPATi, a 6.6kb plasmid, 
which was digested with Nco l and Bam HI and a 2.3kb Ncol- Bam HI fragment obtained having the cm -AT 
gene and the GAPDH terminator and approximately 400bp Bam HI-Ncol fragment obtained having the 
GAPDH promoter. These fragments were ligated together and inserted into the Bam HI site of pC1/1. The 

35 plasmids pCl/1GAPATi8 and pC1/1GAPATi9 were obtained with the orientation of expression clockwise in 
the former and counterclockwise in the latter, with amp f being in the counterclockwise direction. These 
plasmids were transformed in S. cerevisiae AB103 (AT.C.C. No. 20658, deposited January 5, 1983) by 
standard methods, selecting for leucine prototrophy and grown as described above. Yeast extracts were 
prepared by lysis with glass beads and the on -AT activity determined by inhibition of human leukocyte 

40 elastase. 

Assays contained in 1ml:0.1-0.2 human leukocyte elastase (HLE); 0.1mM MeO-Suc-Ala-Ala-Pro-Val-p- 
nitro-anilide (Beatty et al., J. Biol. Chem. (1980) 255:3931); 50mM Tris, pH 8, 0.5M NaCI, and the indicated 
amounts of yeast extract or human 01 -AT. Assays were initiated by the addition of elastase, incubated at 
28 *C for 15min, terminated by the addition of 100ul of 8N acetic acid and the absorbance at 410nm 
45 determined. Typical results are shown in the following Table 3. 
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The above data demonstrate that piasmids having the orientation of the expression cassette in the 
55 counterclockwise orientation, the promoter proximal to the long sequence of pBR322, make 10-20 times 
more ai-AT than the same cassette in the other orientation. 
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Yeast strain AB110 . 

Yeast strain 2150-2-3 was crossed with a yeast strain AB103 transformant containing pC1/1GAPATi9. 
The diploids were sporulated and the tetrads dissected. Strains were maintained on leucine selective plates 
5 in order to ensure maintenance of the plasmid, since the parents are auxotrophs. A series of colonies were 
screened for their genotype with respect to a number of markers. The most vigorous strains were selected 
and cultures grown on leucine selective media. The best strain was designated AB110 (pC1/1GAPATi9), 
gave 6-7.5% of the total cell protein as on -AT as shown in the above Table 3. The strain AB110 has the 
following genotype: Mat*, ura3-52 t leu2-04 or both leu2-3 and Ieu2-112, pep4-3. his4-580 (cir • ). 

w 

Phosphate induction . 

Plasmid pPGT80 was digested with Bam HI, the ends blunt-ended, followed by digestion with Xbal and 
the 500bp fragment containing the GAPDH promoter and 5'-end of the sAg gene isolated. 

15 The PH05 gene was isolated from a yeast genomic library employing an oligonucleotide probe 5 f - 
GGCACTCACACGTGGGACTAG-3' derived from the published partial sequence (Meyhack et al„ The 
EMBO Journal (1982) 1^:675-680). A subfragment of this clone containing 550bp of the 5'-untranslated 
region and approximately 80bp of coding sequence was subcloned as a Bam HI-Sall substitution in pBR322 
to provide pPHOS. This fragment has the sequence 5'-ATGTTTAAA-3\ encoding the first three amino acids, 

20 the second and third codons specifying an Ahalll site. The plasmid pHBS6 was digested with Ncol, blunt- 
ended, followed by digestion with Bam HI and treatment with alkaline phosphatase. The PH05~promoter 
region was obtained by digesting the pPH05 plasmid with Ahalll, resecting the resulting fragment with 
Bal31 for a short time, followed by digestion with Bam HI and isolation of a 500-550bp Bam HI blunt-ended 
fragment. This fragment was employed for substitution of the Ncol-BamHI fragment from pHBS6 and was 

25 screened for regeneration of the Ncol restriction site to provide plasmid pHBS6PH05/1. 

Plasmid pHBS6PH05/1 was digested with BstEII which cleaves at position -175 in the PH05 promoter. 
This molecule was blunt-ended, digested with Sail and the 650bp fragment having the S'-portion of the 
promoter domain, containing 275bp of pBR322 and 375bp of the PH05 promoter region isolated. This 
fragment was ligated with the blunt-ended (Bam HI)-Xbal fragment obtained from digestion of pPGT80 with 

30 BamHI, blunt ending, followed by Xba l digestion. After digesting the ligated fragment with Sail and Xbal, the 
resulting fragment was then substituted into pPGT16-3 which had been digested with SajT and Xbal and 
treated with alkaline phosphatase. The resulting plasmid pPHO5PGT80 had a cassette comprising the 
PH05 regulatory region, the GAPDH promoter, the sAg gene and the ADH terminator. This cassette was 
excised from the plasmid by Bam HI digestion, whereby a 1.8kb Bam HI- Bam HI fragment was gel isolated 

35 and ligated into the Bam HI site of Bam HI digested and alkaline phosphatase treated pCl/1 to provide 
plasmids PH05GAP1 and PH05GAP2 where the PH05 was distal and proximal to the long pBR322 
sequence, respectively. 

The two plasmids were transformed into yeast strain 2150-2-3 as described above and grown in rich 
media as described above for 8 to 10 generations in either high (7mM) or low (0.2mM) phosphate. Samples 
40 were harvested in late log phase and HBsAg determined as described previously. The results are shown 
below in Table 4. 



Table 4 



Regulation of HBsAg Production in Yeast using a Hybrid PH05/GAPDH Promoter, 


Construction 


High Phosphate (7mM) 


Low Phosphate (0.2m M) 


Induction low / high 




(sAg ug/mg protein) 




PH05GAP-1 


0.08 


0.95 


12.0 


PH05GAP-2 


0.27 


0.40 


1.5 



From the above results, it is evident that effective regulation with phosphate is obtained, with one 
orientation being superior to the other. 

It is evident from the above results that highly efficient expression can be obtained, either constitutive or 
regulated, by providing for truncated promoter regions of yeast glycolytic enzyme gene promoters, 
employing the 3' domain proximal to the coding region of the gene in conjunction with a S'-portion or 
second domain of the promoter region of a yeast gene subject to inducible regulation by a nutrient, e.g., 
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carbon source or phosphate, temperature, or other externally controllable source or condition. Alternatively, 
the second domain may be replaced by prokaryotic sequences of at least about 1kb or greater, which 
provide for constitutive enhancement in the absence of the second domain of the promoter region. Thus, a 
wide variety of genes exogenous to yeast may be expressed in high yield in high percentages of the total 
5 protein of the yeast host. 

Although the foregoing invention has been described in some detail by way of illustration and example 
for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be 
practiced within the scope of the appended claims. 

70 Claims 

1. A method for obtaining expression of a foreign structural gene in a yeast host employing hybrid 
constructions involving transcriptional and translational regulatory signals recognized by said host 
joined to a foreign gene for transcriptional and translational regulation of expression in said host, 

75 wherein said construction is introduced into said host and said foreign gene is expressed to produce 
the encoded polypeptide, comprising: 

employing for transcriptional regulation, a DNA sequence having first and second domains, 
said first domain being proximal to said structural gene, being of less than about 600bp and having a 
sequence which includes the RNA polymerase binding site and transcription initiation site of a yeast 
20 glyceraldehyde-3-phosphate dehydrogenase gene; 

said second domain being proximal to said first domain and distal to said structural gene, providing for 
enhanced efficiency of transcription and being the regulatory region of a regulated yeast gene other 
than the natural gene of said first domain; and 
said regulatory region being the PH05 regulatory region. 

25 

2. A method according to Claim 1, wherein said polypeptide is Hepatitis B surface antigen, superoxide 
dismutase, or 01 -antitrypsin. 

3. A method according to claim 1, wherein said regulatory region is the ADHII regulatory region. 

30 

4. A method for obtaining expression of a foreign polypeptide in a yeast host employing a hybrid 
construction involving transcriptional and translational regulatory signals recognized by said host joined 
to a foreign gene for transcriptional and translational regulation of expression in said host, wherein said 
construction is introduced into said host and said foreign gene is expressed to produce the encoded 

35 polypeptide, comprising employing for transcriptional regulation, a DNA sequence having first and 
second domains, 

said first domain being proximal to said structural gene, being of about 200-500bp, being derived from 
the S'-untranslated region of the yeast glyceraldehyde-3-phosphate dehydrogenase or pyruvate kinase 
gene and extending downstream to at least nucleotide -10; and 
40 said second domain being distal to said first domain and proximal to said structural gene, being a 
prokaryotic sequence of at least about 1000bp, wherein said yeast cells containing said construct are 
grown to high density; and 
isolating said polypeptide. 

45 5- A method for preparing a polypeptide in a yeast host, where the polypeptide is heterologous to the 
yeast and may be produced in low percentage amounts of total protein, comprising: 
joining an open reading frame DNA sequence coding for said polypeptide with a second open reading 
frame DNA sequence coding for heterologous human superoxide dismutase to form a fusion polypep- 
tide; 

so introducing the sequence coding for said fusion polypeptide under conditions for expression in said 
yeast which includes an inducible transcriptional initiation regulatory region that consists essentially of, 
reading in a 5* to 3* direction, a yeast glycolytic enzyme promoter region and adjacent alcohol 
dehydrogenase-2 control region, whereby said fusion polypeptide is expressed; 
isolating said fusion polypeptide to provide the polypeptide in high yield. 

55 
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Patentansprtlche 

1. Verfahren zum Erhalt eines fremden strukturellen Gens in einem Hefewirt unter Verwendung von 
hybriden Konstrukten, die transkriptionelle und translatorische, von dem Wirt erkannte Regulationssi- 

5 gnale, welche mit einem Fremdgen zur transkriptionellen und translatorischen Regulation der Expres- 
sion in dem Wirt verbunden sind, einschlieBen, wobei das Konstrukt in den Wirt eingefQhrt und das 
Fremdgen zur Bildung des kodierten Polypeptids exprimiert wird, 
gekennzeichnet durch 

die Verwendung einer DNA-Sequenz mit ersten und zweiten DomSnen zur transkriptionellen Regulation, 
to wobei die erste Domane sich proximal zu dem Strukturgen befindet, weniger als etwa 600 bp hat und 
eine Sequenz aufweist, die die Bindungsstelle der RNA-Polymerase und die 
Initiationsstelle 

der Transkription eines Hefe-Glyceraldehyd-3-phosphat-dehydrogenasegens einschlieBt; und wobei sich 
die zweite Domane proximal zu der ersten DomSne sowie zu dem Strukturgen befindet, die verbesserte 
75 Effizienz der Transkription bewirkt und die Regulationsregion eines regulierten Hefegens, das ein 
anderes als das natOrliche Gen der ersten Domane ist und die Regulationsregion die PH05-Regula- 
tionsregion ist. 

2. Verfahren nach Anspruch 1, bei dem das Polypeptid das Hepatitis-B-Oberflachen-Antigen, Superoxid- 
20 dismutase oder alphai -Antitrypsin ist. 

3. Verfahren nach Anspruch 1, bei dem die Regulationsregion die ADHII-Regulationsregion ist. 

4. Verfahren zum Erhalt der Expression eines Fremdpoiypeptids in einem Hefewirt unter Verwendung 
25 eines Hybridkonstrukts, das transkriptionelle und translatorische. von dem Wirt erkannte Regulationssi- 

gnale, welche mit einem Fremdgen zur transkriptionellen und translatorischen Expression in dem Wirt 
erkannte Regulationssignale einschlieBt, wobei das Konstrukt in den Wirt eingeftlhrt und das Fremdgen 
zur Bildung des kodierten Polypeptids exprimiert wird, und bei dem fOr die transkriptionelle Regulation 
eine DNA-Sequenz mit ersten und zweiten DomMnen verwendet wird, 
30 wobei die erste Domane sich proximal zu dem Strukturgen befindet, weniger als etwa 200 - 500 bp hat 
und von der S'-nichttranslatierten Region des Hefe-Glyceraldehyd-3-Phosphat-Dehydrogenase- oder 
Pryruvatkinase-Gens abgeleitet ist und sich stromabwSrts bis mindestens zu dem Nucleotid -10 
erstreckt, und 

die zweite DomSne sich distal zu der ersten DomSne und proximal zu dem Strukturgen befindet, 
35 welches eine prokaryotische Sequenz von mindestens etwa 1000 bp ist, wobei die das Konstrukt 
enthaltenden Zellen bis zu einer hohen Dichte gezOchtet werden, und 
das Polypeptid isoliert wird. 

5. In einem Verfahren zur Herstellung eines Polypeptids in einem Hefewirt, wobei das Polypeptidheterolog 
40 zu der Hefe ist und in niedrigen prozentualen Mengen des Gesamtproteins hergestellt wird, eine 

Verbesserung, die folgendes umfaBt: 

VerknQpfung einer DNA-Sequenz in offenem Leseraster, die fur das Polypeptid kodiert, mit einer DNA- 
Sequenz eines zweiten offenen Leserasters, welche fOr heterologe menschliche Superoxiddismutase 
kodiert, unter Bildung eines Fusionspolypeptids; 
45 EinfOhrung der fUr das Fusionspolypeptid unter Bedingung der Expression kodierenden Sequenz in die 
Hefe, welche eine induzierbare transkriptionelle Initiations-Regulationsregion einschlieBt, welche im 
wesentlichen aus, gelesen in 5'- S'-Richtung, eine Promoterregion fOr eine glycolytisches Enzym aus 
Hefe und eine benachbarte Alkohol Dehydrogenase-2-Kontrollregion besteht, und 
Isolation des Fusionspolypeptids unter Bildung des Polypeptids in hoher Ausbeute. 

50 

Revendlcations 

1. Proc6d6 d'obtention de Texpression d'un g&ne structurel Stranger dans une levure hote employ ant des 
constructions hybrides faisant appel & des signaux r^gulateurs de transcription et de traduction 
55 reconnus par ledit hote soud§ h un gfene Stranger pour la regulation de la transcription et de la 
traduction de ^expression dans ledit hote, dans lequel ladite construction est introduce dans ledit hote 
et ledit g&ne Stranger est exprimS pour produire le polypeptide cod6, comprenant : 

Pemploi, pour la regulation de la transcription, d'une sequence d'ADN poss6dant un premier 
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domains et un second domaine, 

(edit premier domaine 6tant proximal dudit g&ne structurel, pr^sentant moins d'environ 600 pb et 
poss^dant une sequence qui renferme un site de liaison d'ARN polymerase et un site d'initiation de la 
transcription d'un g£ne de levure glyc6rald6hyde-3-phosphate d^shydrog^nase; 
5 ledit second domaine etant proximal dudit premier domaine et distal dudit gfene structurel, 

permettant d'ameiiorer I'efficacite de la transcription et formant la region r6gulatrice d'un g&ne de 
levure reguie autre que ledit gene nature! dudit premier domaine; et 

ladite region rggulatrice etant la region r^gulatrice PH05. 

w 2. Precede selon la revendication 1, dans lequel ledit polypeptide est I'antigfene de surface de ('hepatite 
B, la superoxyde dismutase ou Pen -antitry psine. 

3. Proc6d6 selon la revendication 1, dans lequel ladite region rSgulatrice est la region r^gulatrice ADHII. 

75 4. Procede d'obtention de I'expression d'un polypeptide Stranger dans une levure hote employant une 
construction hyrbide faisant appel a des signaux rfgulateurs de transcription et de traduction reconnus 
par ledit hote soude a un g&ne Stranger pour la regulation de la transcription et de la traduction de 
I'expression dans ledit hdte, dans lequel ladite construction est introduite dans ledit h6te et ledit g&ne 
Stranger est exprime pour produire le polypeptide code, comprenant I'emploi, pour la regulation de la 
20 transcription, d'une sequence d'ADN pos$6dant un premier domaine et un second domaine, 

ledit premier domaine etant proximal dudit g£ne structurel, pr#sentant environ 200-500 pb, etant 
derive de la region 5-non traduite du gfene de levure glyceraldehyde-3-phosphate deshydrogenase ou 
pyruvate kinase et se prolongeant en aval au moins jusqu'au nucleotide -10; et 

ledit second domaine etant distal dudit premier domaine et proximal dudit gfene structurel, etant 
25 une sequence procaryote d'au moins environ 1000 pb, dans lequel lesdites cellules de levure contenant 
ledit produit de recombinaison sont cultiv£es jusqu'a une haute density et 
I'isolement dudit polypeptide. 

5. Procede de preparation d'un polypeptide dans une levure h6te, dans laquelle le polypeptide est 
30 heterologue de la levure et peut etre produit en faible pourcentage de la proteine totale, comprenant : 
la soudure d'une sequence d'ADN a cadre de lecture ouvert codant pour ledit polypeptide avec 
une seconde sequence d'ADN a cadre de lecture ouvert codant pour la superoxyde dismutase 
humaine heterologue, pour former un polypeptide de fusion; 

('introduction de la sequence codant pour ledit polypeptide de fusion, dans des conditions 
35 d'expression dans ladite levure qui renferme une region regulatrice de Hnitiation de la transcription 
pouvant etre induite, qui consiste essentiellement a lire dans le sens 5' a 3' une region promoteur 
d'enzyme glycolytique de levure et une region adjacente de controle de I'alcool deshydrogenase-2, ce 
qui permet d'exprimer ledit polypeptide de fusion; 

I'isolement dudit polypeptide de fusion pour obtenir le polypeptide avec un rendement eleve. 
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